llama.cpp/examples/speculative
Georgi Gerganov f07cd35da4
speculative : fix off-by-one for n_drafted
2023-10-17 11:40:26 +03:00
..
CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling (#2926) 2023-09-03 15:12:08 +03:00
speculative.cpp speculative : fix off-by-one for n_drafted 2023-10-17 11:40:26 +03:00