Xuan-Son Nguyen
|
7b1db3d3b7
|
arg: clarify auto kvu/np being set on server (#17997)
* arg: clarify auto kvu/np being set on server
* improve docs
* use invalid_argument
|
2025-12-16 12:01:27 +01:00 |
Andrew Aladjev
|
4a4f7e6550
|
cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993)
Co-authored-by: Andrew Aladjev <andrew.aladjev@gmail.com>
|
2025-12-15 11:47:04 +01:00 |
Georgi Gerganov
|
254098a279
|
common : refactor common_sampler + grammar logic changes (#17937)
* common : refactor common_sampler + grammar logic changes
* tests : increase max_tokens to get needed response
* batched : fix uninitialized samplers
|
2025-12-14 10:11:13 +02:00 |
Xuan-Son Nguyen
|
34a6d86982
|
cli: enable jinja by default (#17911)
* cli: enable jinja by default
* Update common/arg.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-12-10 22:19:42 +01:00 |
Xuan-Son Nguyen
|
6c2131773c
|
cli: new CLI experience (#17824)
* wip
* wip
* fix logging, add display info
* handle commands
* add args
* wip
* move old cli to llama-completion
* rm deprecation notice
* move server to a shared library
* move ci to llama-completion
* add loading animation
* add --show-timings arg
* add /read command, improve LOG_ERR
* add args for speculative decoding, enable show timings by default
* add arg --image and --audio
* fix windows build
* support reasoning_content
* fix llama2c workflow
* color default is auto
* fix merge conflicts
* properly fix color problem
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
* better loading spinner
* make sure to clean color on force-exit
* also clear input files on "/clear"
* simplify common_log_flush
* add warning in mtmd-cli
* implement console writter
* fix data race
* add attribute
* fix llama-completion and mtmd-cli
* add some notes about console::log
* fix compilation
---------
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
|
2025-12-10 15:28:59 +01:00 |