llama.cpp/examples/speculative
fmz 9a4bdc8c12 Introduce ggml_threadpool
Added an API to support explicit management of threadpools.
2024-05-24 12:04:04 -07:00
..
CMakeLists.txt build : link against build info instead of compiling against it (#3879) 2023-11-02 08:50:16 +02:00
README.md speculative : implement stochastic speculative sampling (#5625) 2024-03-04 20:24:00 +02:00
speculative.cpp Introduce ggml_threadpool 2024-05-24 12:04:04 -07:00

README.md

llama.cpp/examples/speculative

Demonstration of speculative decoding and tree-based speculative decoding techniques

More info: