llama.cpp/ggml/src/ggml-webgpu/wgsl-shaders
Reese Levine 647b960bd8
ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031)
* Faster tensors (#8)

Add fast matrix and matrix/vector multiplication.

* Use map for shader replacements instead of pair of strings
2025-11-07 19:27:20 -08:00
..
bin_op.tmpl.wgsl ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187) 2025-09-30 09:57:51 -07:00
binary_head.tmpl GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018) 2025-09-17 13:09:40 -07:00
common_decls.tmpl GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018) 2025-09-17 13:09:40 -07:00
cpy.tmpl.wgsl ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187) 2025-09-30 09:57:51 -07:00
embed_wgsl.py ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
get_rows.tmpl.wgsl ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187) 2025-09-30 09:57:51 -07:00
glu.tmpl.wgsl ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187) 2025-09-30 09:57:51 -07:00
memset.wgsl ggml WebGPU: add support for quantization types (#15440) 2025-08-22 11:28:03 -07:00
mul_mat.tmpl.wgsl ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
mul_mat_decls.tmpl ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
mul_mat_reg_tile.tmpl.wgsl ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
mul_mat_subgroup_matrix.tmpl.wgsl ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
mul_mat_vec.tmpl.wgsl ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031) 2025-11-07 19:27:20 -08:00
rms_norm.wgsl ggml webgpu: actually add softmax, fix rms_norm offset (#16400) 2025-10-04 20:59:31 -07:00
rope.tmpl.wgsl model: add support for qwen3vl series (#16780) 2025-10-30 16:19:14 +01:00
scale.tmpl.wgsl ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187) 2025-09-30 09:57:51 -07:00
set_rows.tmpl.wgsl ggml webgpu: minor set rows optimization (#16810) 2025-11-05 10:27:42 +01:00
soft_max.tmpl.wgsl ggml webgpu: actually add softmax, fix rms_norm offset (#16400) 2025-10-04 20:59:31 -07:00