..
batched-bench
tool/ex/tests: consistently free ctx, then model ( #18168 )
2025-12-22 11:00:37 +01:00
cli
server: update docs for sleeping [no ci] ( #18777 )
2026-01-12 13:01:24 +01:00
completion
server: update docs for sleeping [no ci] ( #18777 )
2026-01-12 13:01:24 +01:00
cvector-generator
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
export-lora
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
fit-params
llama-fit-params: free memory target per device ( #18679 )
2026-01-08 10:07:58 +01:00
gguf-split
cli: new CLI experience ( #17824 )
2025-12-10 15:28:59 +01:00
imatrix
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
llama-bench
llama-bench: add direct_io parameter ( #18778 )
2026-01-13 08:49:10 +01:00
mtmd
mtmd: fix use_non_causal being reported incorrectly ( #18793 )
2026-01-13 12:19:38 +01:00
perplexity
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
quantize
quantize: prevent input/output file collision ( #18451 )
2025-12-31 23:29:03 +08:00
rpc
Install rpc-server when GGML_RPC is ON. ( #17149 )
2025-11-11 10:53:59 +00:00
server
feat: Simplify MCP server enabling logic per chat
2026-01-19 16:43:53 +01:00
tokenize
cmake : Do not install tools on iOS targets ( #15903 )
2025-09-16 09:54:44 +07:00
tts
common : refactor common_sampler + grammar logic changes ( #17937 )
2025-12-14 10:11:13 +02:00
CMakeLists.txt
cmake: only build cli when server is enabled ( #18670 )
2026-01-09 16:43:26 +01:00