..
batched-bench
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
cli
common : use two decimal places for float arg help messages ( #19048 )
2026-01-25 07:31:42 +01:00
completion
completion : fix prompt cache for recurrent models ( #19045 )
2026-01-25 09:12:50 +02:00
cvector-generator
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
export-lora
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
fit-params
llama-fit-params: keep explicit --ctx-size 0 ( #19070 )
2026-01-24 22:13:08 +01:00
gguf-split
cli: new CLI experience ( #17824 )
2025-12-10 15:28:59 +01:00
imatrix
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
llama-bench
Setting mmap and direct_io to false as default in llama-bench.cpp ( #18841 )
2026-01-16 09:46:51 +01:00
mtmd
mtmd: support MiniCPM-o 4.5(vision only) ( #19211 )
2026-01-30 23:19:30 +01:00
perplexity
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
quantize
quantize: add option --tensor-type-file to llama-quantize ( #18572 )
2026-01-31 11:39:21 +08:00
rpc
Install rpc-server when GGML_RPC is ON. ( #17149 )
2025-11-11 10:53:59 +00:00
server
server : wrap around the "id_slot" parameter ( #19207 )
2026-01-30 19:46:10 +02:00
tokenize
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
tts
refactor : remove libcurl, use OpenSSL when available ( #18828 )
2026-01-14 18:02:47 +01:00
CMakeLists.txt
cmake: only build cli when server is enabled ( #18670 )
2026-01-09 16:43:26 +01:00