llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Aaron Lee 6e9bafc7a7 failed attempt to implement MTP; outputs tokens but KV cache management is unreasonable		2025-08-15 23:13:56 -04:00
..
llama-cpp.h	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
llama.h	failed attempt to implement MTP; outputs tokens but KV cache management is unreasonable	2025-08-15 23:13:56 -04:00