llama.cpp/ggml/src/ggml-vulkan
Ruben Ortlam 47a268ea50
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900)
* vulkan: split mul_mmq_funcs for mul_mat_vecq use

* add mxfp4 mmvq

* add q2_k mmvq

* add q3_k mmvq

* add q4_k and q5_k mmvq

* add q6_k mmvq

* handle 4x4 quants per mmvq thread

* enable MUL_MAT_ID mmvq support

* enable subgroup optimizations for mul_mat_vec_id shaders

* device tuning

* request prealloc_y sync after quantization

* fix indentation

* fix llvmpipe test failures

* fix mul_mat_id mmvq condition

* fix unused variable warning
2025-11-29 09:37:22 +01:00
..
cmake cmake: fix ggml-shaders-gen compiler paths containing spaces (#12747) 2025-04-04 10:12:40 -03:00
vulkan-shaders Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900) 2025-11-29 09:37:22 +01:00
CMakeLists.txt vulkan: Improve build time for MSVC (#16545) 2025-10-14 14:51:36 +02:00
ggml-vulkan.cpp Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900) 2025-11-29 09:37:22 +01:00