llama.cpp

History

Jeff Bolz 4cb208c93c vulkan: coopmat2 mul_mat optimizations (#14934 ) - Increase tile size for k-quants, to match non-k-quants - Choose more carefully between large and medium tiles, considering how it interacts with split_k - Allow larger/non-power of two split_k, and make the splits a multiple of 256 - Use split_k==3 to when >1/2 and <=2/3 of the SMs would hae been used		2025-08-02 11:21:37 +02:00
..
cmake	cmake : Fix BLAS link interface (ggml/1316)	2025-07-30 17:33:11 +03:00
include	ggml: Add initial WebGPU backend (#14521 )	2025-07-16 18:18:51 +03:00
src	vulkan: coopmat2 mul_mat optimizations (#14934 )	2025-08-02 11:21:37 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930 )	2025-07-29 17:44:30 +02:00