llama.cpp/examples/speculative
Georgi Gerganov cd1e937821
sampling : refactor init to use llama_sampling_params
2023-10-20 14:58:20 +03:00
..
CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling (#2926) 2023-09-03 15:12:08 +03:00
speculative.cpp sampling : refactor init to use llama_sampling_params 2023-10-20 14:58:20 +03:00