Xuan-Son Nguyen
|
6c2131773c
|
cli: new CLI experience (#17824)
* wip
* wip
* fix logging, add display info
* handle commands
* add args
* wip
* move old cli to llama-completion
* rm deprecation notice
* move server to a shared library
* move ci to llama-completion
* add loading animation
* add --show-timings arg
* add /read command, improve LOG_ERR
* add args for speculative decoding, enable show timings by default
* add arg --image and --audio
* fix windows build
* support reasoning_content
* fix llama2c workflow
* color default is auto
* fix merge conflicts
* properly fix color problem
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
* better loading spinner
* make sure to clean color on force-exit
* also clear input files on "/clear"
* simplify common_log_flush
* add warning in mtmd-cli
* implement console writter
* fix data race
* add attribute
* fix llama-completion and mtmd-cli
* add some notes about console::log
* fix compilation
---------
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
|
2025-12-10 15:28:59 +01:00 |
Vedran Miletić
|
e9b6350e61
|
scripts : make the shell scripts cross-platform (#14341)
|
2025-06-30 10:17:18 +02:00 |
Georgi Gerganov
|
f11cfdfd7f
|
ci : use -no-cnv in gguf-split tests (#11254)
* ci : use -no-cnv in gguf-split tests
ggml-ci
* ci : use -no-cnv in requantize tests
ggml-ci
* scripts : fix [no ci]
|
2025-01-15 18:28:35 +02:00 |
ltoniazzi
|
253b7fde91
|
Fix HF repo commit to clone lora test models (#10649)
|
2024-12-04 10:45:48 +01:00 |
Xuan Son Nguyen
|
3ba780e2a8
|
lora : fix llama conversion script with ROPE_FREQS (#9117)
|
2024-08-23 12:58:53 +02:00 |
ltoniazzi
|
2339a0be1c
|
tests : add integration test for lora adapters (#8957)
* Add printing to check weights match torch version
* minor code style changes
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2024-08-18 11:58:04 +02:00 |