llama.cpp

History

Daniel Bevenius 5ea3be265b cuda : fix top-k compilation when CUB is unavailable This commit adds a macro guard around argsort_f32_i32_cuda_cub usage in the top-k fallback path, falling back to bitonic sort when GGML_CUDA_USE_CUB is not defined. The motivation for this is that some environments like AMD HIP do not have CUB available, causing compilation failure. Refs: https://github.com/ggml-org/llama.cpp/actions/runs/19728226426/job/56523606840#step:6:208		2025-11-27 09:40:13 +01:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )	2025-08-07 13:45:41 +02:00
include	ggml : add ggml_top_k (#17365 )	2025-11-25 15:31:43 +02:00
src	cuda : fix top-k compilation when CUB is unavailable	2025-11-27 09:40:13 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : remove dirty flag from version string (ggml/1391)	2025-11-24 15:26:31 +02:00