llama.cpp/ggml/src/ggml-vulkan/vulkan-shaders
Ruben Ortlam 635ef78ec5
vulkan: work around Intel fp16 bug in mmq (#18814)
2026-01-14 09:41:23 +01:00
..
feature-tests vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
CMakeLists.txt vulkan: Fix GGML_VULKAN_SHADER_DEBUG_INFO (#14427) 2025-06-27 22:35:30 -05:00
abs.comp vulkan: implement ABS and NEG (#17245) 2025-11-15 12:00:29 +01:00
acc.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
add.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
add1.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
add_id.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
arange.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
argmax.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
argsort.comp vulkan: support larger argsort (#17313) 2025-11-19 17:25:50 +01:00
argsort_large.comp vulkan: support larger argsort (#17313) 2025-11-19 17:25:50 +01:00
ceil.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
clamp.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
concat.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
contig_copy.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
conv2d_dw.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
conv2d_mm.comp vulkan : support conv-2d with large output size (#17685) 2025-12-05 21:46:39 +01:00
conv_transpose_1d.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
copy.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
copy_from_quant.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
copy_to_quant.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
copy_transpose.comp vulkan: Add copy_transpose shader (#17371) 2025-11-19 16:50:43 +01:00
cos.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
count_equal.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
count_experts.comp vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352) 2025-12-26 16:12:58 -06:00
cumsum.comp vulkan: Optimize GGML_OP_CUMSUM (#18417) 2026-01-02 15:32:30 -06:00
cumsum_multipass1.comp vulkan: Optimize GGML_OP_CUMSUM (#18417) 2026-01-02 15:32:30 -06:00
cumsum_multipass2.comp vulkan: Optimize GGML_OP_CUMSUM (#18417) 2026-01-02 15:32:30 -06:00
dequant_f32.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_funcs.glsl vulkan: more mul mat optimizations (#18533) 2026-01-07 11:13:17 +01:00
dequant_funcs_cm2.glsl Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 2025-10-29 14:39:03 +01:00
dequant_head.glsl vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq1_m.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq1_s.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq2_s.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq2_xs.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq2_xxs.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq3_s.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq3_xxs.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq4_nl.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_iq4_xs.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_mxfp4.comp Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 2025-10-29 14:39:03 +01:00
dequant_q2_k.comp Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 2025-10-29 14:39:03 +01:00
dequant_q3_k.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q4_0.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q4_1.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q4_k.comp Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 2025-10-29 14:39:03 +01:00
dequant_q5_0.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q5_1.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q5_k.comp Vulkan MMQ Integer Dot Refactor and K-Quant support (#16536) 2025-10-29 14:39:03 +01:00
dequant_q6_k.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
dequant_q8_0.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
diag.comp vulkan: support GGML_OP_DIAG (#17893) 2025-12-13 10:07:49 +01:00
diag_mask_inf.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
div.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
exp.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
fill.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
flash_attn.comp vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887) 2025-12-14 09:00:00 +01:00
flash_attn_base.glsl vulkan: Support FA with K/V in F32 (#16543) 2025-10-14 15:53:37 +02:00
flash_attn_cm1.comp vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887) 2025-12-14 09:00:00 +01:00
flash_attn_cm2.comp vulkan : fix FA mask load with bounds check (coopmat2) (#17606) 2025-11-30 01:03:21 +01:00
flash_attn_split_k_reduce.comp vulkan: Handle FA with all -inf mask values (#16447) 2025-10-20 22:16:08 -05:00
floor.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
geglu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
geglu_erf.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
geglu_quick.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
gelu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
gelu_erf.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
gelu_quick.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
generic_binary_head.glsl Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900) 2025-11-29 09:37:22 +01:00
generic_head.glsl vulkan: support GGML_UNARY_OP_XIELU (#18062) 2025-12-21 10:17:58 +01:00
generic_unary_head.glsl Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900) 2025-11-29 09:37:22 +01:00
get_rows.comp vulkan: support get_rows for i32 (#17941) 2025-12-13 10:12:53 +01:00
get_rows_quant.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
glu_head.glsl vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
glu_main.glsl vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
group_norm.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
hardsigmoid.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
hardswish.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
im2col.comp vulkan: fix im2col overflowing maxworkgroupcount (#18180) 2025-12-21 10:32:58 +01:00
im2col_3d.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
l2_norm.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
leaky_relu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
log.comp vulkan: add log RTE support to fix Nvidia CI (#17320) 2025-11-17 14:37:49 -06:00
mul.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
mul_mat_split_k_reduce.comp vulkan: optimize and reenable split_k (#10637) 2024-12-03 20:29:54 +01:00
mul_mat_vec.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_base.glsl vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iface.glsl Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900) 2025-11-29 09:37:22 +01:00
mul_mat_vec_iq1_m.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq1_s.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq2_s.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq2_xs.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq2_xxs.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq3_s.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_iq3_xxs.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_nc.comp vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287) 2025-11-15 19:54:23 +01:00
mul_mat_vec_p021.comp vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287) 2025-11-15 19:54:23 +01:00
mul_mat_vec_q2_k.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_q3_k.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_q4_k.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_q5_k.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vec_q6_k.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vecq.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mat_vecq_funcs.glsl vulkan: Implement mmvq for iq1_s/iq1_m (#18450) 2026-01-02 20:19:04 +01:00
mul_mm.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mm_cm2.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mm_funcs.glsl vulkan: more mul mat optimizations (#18533) 2026-01-07 11:13:17 +01:00
mul_mm_id_funcs.glsl vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352) 2025-12-26 16:12:58 -06:00
mul_mmq.comp vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678) 2026-01-12 12:32:13 +01:00
mul_mmq_funcs.glsl vulkan: work around Intel fp16 bug in mmq (#18814) 2026-01-14 09:41:23 +01:00
mul_mmq_shmem_types.glsl vulkan: fix shmem overrun in mmq id shader (#16873) 2025-10-31 08:14:49 +01:00
multi_add.comp vulkan: Fix multi_add invalid descriptor usage (#16899) 2025-11-01 06:52:14 +01:00
neg.comp vulkan: implement ABS and NEG (#17245) 2025-11-15 12:00:29 +01:00
norm.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
opt_step_adamw.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
opt_step_sgd.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
pad.comp ggml : add circular tiling support to pad, for Vulkan, CUDA, and CPU (used for making seamless textures) (#16985) 2025-12-06 15:07:02 +01:00
pool2d.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
quantize_q8_1.comp vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515) 2026-01-05 11:30:14 +01:00
reglu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
relu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
repeat.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
repeat_back.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
rms_norm.comp vulkan: add more num_blocks instantiations in rms_norm (#17701) 2025-12-05 22:08:56 +01:00
rms_norm_back.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
rms_norm_partials.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
roll.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
rope_funcs.glsl vulkan: Extend rope fusions to allow mrope (#18264) 2025-12-22 11:03:13 -06:00
rope_head.glsl vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (#16977) 2025-11-08 08:52:15 +01:00
rope_multi.comp vulkan: handle rope with large number of rows (#18306) 2025-12-26 16:53:46 +01:00
rope_neox.comp vulkan: handle rope with large number of rows (#18306) 2025-12-26 16:53:46 +01:00
rope_norm.comp vulkan: handle rope with large number of rows (#18306) 2025-12-26 16:53:46 +01:00
rope_params.glsl vulkan: handle rope with large number of rows (#18306) 2025-12-26 16:53:46 +01:00
rope_vision.comp vulkan: handle rope with large number of rows (#18306) 2025-12-26 16:53:46 +01:00
round.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
rte.glsl vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
scale.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
sigmoid.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
silu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
silu_back.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
sin.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
soft_max.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
soft_max_back.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
soft_max_large1.comp vulkan: Multi-pass softmax for large number of cols (#17892) 2025-12-13 10:04:29 +01:00
soft_max_large2.comp vulkan: Multi-pass softmax for large number of cols (#17892) 2025-12-13 10:04:29 +01:00
soft_max_large3.comp vulkan: Multi-pass softmax for large number of cols (#17892) 2025-12-13 10:04:29 +01:00
soft_max_large_common.glsl vulkan: Multi-pass softmax for large number of cols (#17892) 2025-12-13 10:04:29 +01:00
softplus.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
solve_tri.comp vulkan: support solve_tri with larger N/K values (#17781) 2025-12-06 08:56:45 +01:00
sqrt.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
square.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
ssm_conv.comp vulkan: Add State Space Model (SSM) Operations Support (#16463) 2025-10-17 14:23:47 +02:00
ssm_scan.comp vulkan: optimize ssm_scan (#18630) 2026-01-08 15:16:54 +01:00
step.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
sub.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
sum_rows.comp vulkan: Implement GGML_OP_CUMSUM (#17479) 2025-11-26 07:08:10 +01:00
sum_rows.glsl vulkan: Implement GGML_OP_CUMSUM (#17479) 2025-11-26 07:08:10 +01:00
swiglu.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
swiglu_oai.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
tanh.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
timestep_embedding.comp vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
topk_argsort.comp vulkan: Reduce temporary memory usage for TOP_K (#17623) 2025-12-02 19:22:04 +01:00
topk_moe.comp vulkan: fix topk_moe_sigmoid_norm_bias failures in GLM-4.6 (#18582) 2026-01-05 11:51:39 +01:00
topk_nary_search.comp vulkan: fix top_k bug when there are ties in the input (#17659) 2025-12-05 22:03:19 +01:00
tri.comp vulkan: Implement GGML_OP_TRI (#17503) 2025-11-28 10:07:29 +01:00
trunc.comp vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319) 2025-11-19 17:29:45 +01:00
types.glsl vulkan: Implement mmvq for iq1_s/iq1_m (#18450) 2026-01-02 20:19:04 +01:00
upscale.comp vulkan: Support UPSCALE w/antialias (#18327) 2025-12-26 17:00:57 +01:00
utils.glsl vulkan : incremental shader builds (#16341) 2025-10-04 11:42:56 +02:00
vulkan-shaders-gen.cpp vulkan: more mul mat optimizations (#18533) 2026-01-07 11:13:17 +01:00
wkv6.comp rwkv6: add wkv6 support for Vulkan backend (#10829) 2024-12-16 22:00:46 +01:00
wkv7.comp llama: Add support for RWKV v7 architecture (#12412) 2025-03-18 07:27:50 +08:00
xielu.comp vulkan: support GGML_UNARY_OP_XIELU (#18062) 2025-12-21 10:17:58 +01:00