..
batched-bench
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
cli
server: update docs for sleeping [no ci] ( #18777 )
2026-01-12 13:01:24 +01:00
completion
server: update docs for sleeping [no ci] ( #18777 )
2026-01-12 13:01:24 +01:00
cvector-generator
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
export-lora
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
fit-params
llama-fit-params: free memory target per device ( #18679 )
2026-01-08 10:07:58 +01:00
gguf-split
cli: new CLI experience ( #17824 )
2025-12-10 15:28:59 +01:00
imatrix
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
llama-bench
llama-bench: add direct_io parameter ( #18778 )
2026-01-13 08:49:10 +01:00
mtmd
Restore clip's cb() to its rightful glory - extract common debugging elements in llama ( #17914 )
2026-01-14 20:29:35 +01:00
perplexity
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
quantize
quantize: prevent input/output file collision ( #18451 )
2025-12-31 23:29:03 +08:00
rpc
Install rpc-server when GGML_RPC is ON. ( #17149 )
2025-11-11 10:53:59 +00:00
server
server: improve slots scheduling for n_cmpl ( #18789 )
2026-01-15 17:10:28 +01:00
tokenize
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
tts
refactor : remove libcurl, use OpenSSL when available ( #18828 )
2026-01-14 18:02:47 +01:00
CMakeLists.txt
cmake: only build cli when server is enabled ( #18670 )
2026-01-09 16:43:26 +01:00