llama.cpp/ggml/src/ggml-vulkan
Eve fbeda9002d
vulkan: matmul dequantization improvements (#12015)
* faster dequant for old quants

* dont use unpack for iq4_nl

* vec2 unpack for q8
2025-02-28 08:20:08 +01:00
..
cmake fix: ggml: fix vulkan-shaders-gen build (#10448) 2025-01-15 14:17:42 +01:00
vulkan-shaders vulkan: matmul dequantization improvements (#12015) 2025-02-28 08:20:08 +01:00
CMakeLists.txt fix: ggml: fix vulkan-shaders-gen build (#10448) 2025-01-15 14:17:42 +01:00
ggml-vulkan.cpp vulkan: fix assertion when qy_needs_dequant (#12068) 2025-02-25 16:30:21 +01:00