llama.cpp

History

Ruben Ortlam 47a268ea50 Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900 ) * vulkan: split mul_mmq_funcs for mul_mat_vecq use * add mxfp4 mmvq * add q2_k mmvq * add q3_k mmvq * add q4_k and q5_k mmvq * add q6_k mmvq * handle 4x4 quants per mmvq thread * enable MUL_MAT_ID mmvq support * enable subgroup optimizations for mul_mat_vec_id shaders * device tuning * request prealloc_y sync after quantization * fix indentation * fix llvmpipe test failures * fix mul_mat_id mmvq condition * fix unused variable warning		2025-11-29 09:37:22 +01:00
..
cmake	cmake: fix ggml-shaders-gen compiler paths containing spaces (#12747 )	2025-04-04 10:12:40 -03:00
vulkan-shaders	Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900 )	2025-11-29 09:37:22 +01:00
CMakeLists.txt	vulkan: Improve build time for MSVC (#16545 )	2025-10-14 14:51:36 +02:00
ggml-vulkan.cpp	Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900 )	2025-11-29 09:37:22 +01:00