llama.cpp/tools
Radoslav Gerganov 2b6b55a59f
server : include usage statistics only when user request them (#16052)
* server : include usage statistics only when user request them

When serving the OpenAI compatible API, we should check if
{"stream_options": {"include_usage": true} is set in the request when
deciding whether we should send usage statistics

closes: #16048

* add unit test
2025-09-18 10:36:57 +00:00
..
batched-bench cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
cvector-generator cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
export-lora cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
gguf-split cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
imatrix cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
llama-bench llama-bench: add --n-cpu-moe support (#15952) 2025-09-16 16:17:08 +02:00
main cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
mtmd cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
perplexity cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
quantize cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
rpc rpc : fix regression when --device is used (#15981) 2025-09-14 12:28:18 +03:00
run cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
server server : include usage statistics only when user request them (#16052) 2025-09-18 10:36:57 +00:00
tokenize cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
tts cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
CMakeLists.txt mtmd : rename llava directory to mtmd (#13311) 2025-05-05 16:02:55 +02:00