llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Georgi Gerganov 16bcc1259d kv-cache : pad the cache size to 256 for performance (#17046 ) * kv-cache : pad the size of the small SWA cache for performance * context : pad the total context to 256 * cont : future-proof the swa pad * server : adjust test params to new logic		2025-11-07 20:03:25 +02:00
..
llama-cpp.h	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
llama.h	kv-cache : pad the cache size to 256 for performance (#17046 )	2025-11-07 20:03:25 +02:00