llama.cpp/ggml/src/ggml-vulkan
Jeff Bolz c9ced4910b
vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352)
Run a preprocess to count how many times each expert is used, and use this to
quickly discard workgroups that aren't needed.
2025-12-26 16:12:58 -06:00
..
cmake cmake: fix ggml-shaders-gen compiler paths containing spaces (#12747) 2025-04-04 10:12:40 -03:00
vulkan-shaders vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352) 2025-12-26 16:12:58 -06:00
CMakeLists.txt vulkan: Improve build time for MSVC (#16545) 2025-10-14 14:51:36 +02:00
ggml-vulkan.cpp vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352) 2025-12-26 16:12:58 -06:00