llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Johannes Gäßler 92b8810ec7 CUDA: skip masked KV slices for all FA kernels (#14924 )		2025-07-30 15:46:13 +02:00
..
cmake	cmake : Indent ggml-config.cmake (ggml/1310)	2025-07-28 08:15:01 +03:00
include	ggml: Add initial WebGPU backend (#14521 )	2025-07-16 18:18:51 +03:00
src	CUDA: skip masked KV slices for all FA kernels (#14924 )	2025-07-30 15:46:13 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930 )	2025-07-29 17:44:30 +02:00