llama.cpp

History

Akarshan 121e192865 server: add optional POST /exit endpoint for graceful shutdown - Introduce --endpoint-exit flag and LLAMA_ARG_ENDPOINT_EXIT env var - Add endpoint_exit to common_params (disabled by default) - Implement POST /exit with explicit confirmation token to prevent misuse - Support graceful shutdown via injected on_shutdown callback - Handle both router and non-router server shutdown paths		2025-12-16 14:25:26 +05:30
..
batched-bench	batched-bench : add "separate text gen" mode (#17103 )	2025-11-10 12:59:29 +02:00
cli	cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993 )	2025-12-15 11:47:04 +01:00
completion	cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993 )	2025-12-15 11:47:04 +01:00
cvector-generator	common : refactor common_sampler + grammar logic changes (#17937 )	2025-12-14 10:11:13 +02:00
export-lora	cmake : Do not install tools on iOS targets (#15903 )	2025-09-16 09:54:44 +07:00
fit-params	llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653 )	2025-12-15 09:24:59 +01:00
gguf-split	cli: new CLI experience (#17824 )	2025-12-10 15:28:59 +01:00
imatrix	common : refactor common_sampler + grammar logic changes (#17937 )	2025-12-14 10:11:13 +02:00
llama-bench	cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993 )	2025-12-15 11:47:04 +01:00
mtmd	mtmd: refactor audio preprocessing (#17978 )	2025-12-15 14:16:52 +01:00
perplexity	common : refactor common_sampler + grammar logic changes (#17937 )	2025-12-14 10:11:13 +02:00
quantize	cli: new CLI experience (#17824 )	2025-12-10 15:28:59 +01:00
rpc	Install rpc-server when GGML_RPC is ON. (#17149 )	2025-11-11 10:53:59 +00:00
run	Manually link -lbsd to resolve flock symbol on AIX (#16610 )	2025-10-23 19:37:31 +08:00
server	server: add optional POST /exit endpoint for graceful shutdown	2025-12-16 14:25:26 +05:30
tokenize	cmake : Do not install tools on iOS targets (#15903 )	2025-09-16 09:54:44 +07:00
tts	common : refactor common_sampler + grammar logic changes (#17937 )	2025-12-14 10:11:13 +02:00
CMakeLists.txt	llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653 )	2025-12-15 09:24:59 +01:00