llama.cpp/ggml
Johannes Gäßler 92b8810ec7
CUDA: skip masked KV slices for all FA kernels (#14924)
2025-07-30 15:46:13 +02:00
..
cmake cmake : Indent ggml-config.cmake (ggml/1310) 2025-07-28 08:15:01 +03:00
include ggml: Add initial WebGPU backend (#14521) 2025-07-16 18:18:51 +03:00
src CUDA: skip masked KV slices for all FA kernels (#14924) 2025-07-30 15:46:13 +02:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930) 2025-07-29 17:44:30 +02:00