llama.cpp

History

Xuan-Son Nguyen ddcb75dd8a server: add auto-sleep after N seconds of idle (#18228 ) * implement sleeping at queue level * implement server-context suspend * add test * add docs * optimization: add fast path * make sure to free llama_init * nits * fix use-after-free * allow /models to be accessed during sleeping, fix use-after-free * don't allow accessing /models during sleep, it is not thread-safe * fix data race on accessing props and model_meta * small clean up * trailing whitespace * rm outdated comments		2025-12-21 02:24:42 +01:00
..
CMakeLists.txt	cli: new CLI experience (#17824 )	2025-12-10 15:28:59 +01:00
README.md	cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993 )	2025-12-15 11:47:04 +01:00
cli.cpp	server: add auto-sleep after N seconds of idle (#18228 )	2025-12-21 02:24:42 +01:00

TODO