llama.cpp

History

Reese Levine 647b960bd8 ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 ) * Faster tensors (#8) Add fast matrix and matrix/vector multiplication. * Use map for shader replacements instead of pair of strings		2025-11-07 19:27:20 -08:00
..
bin_op.tmpl.wgsl	ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187 )	2025-09-30 09:57:51 -07:00
binary_head.tmpl	GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018 )	2025-09-17 13:09:40 -07:00
common_decls.tmpl	GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018 )	2025-09-17 13:09:40 -07:00
cpy.tmpl.wgsl	ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187 )	2025-09-30 09:57:51 -07:00
embed_wgsl.py	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
get_rows.tmpl.wgsl	ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187 )	2025-09-30 09:57:51 -07:00
glu.tmpl.wgsl	ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187 )	2025-09-30 09:57:51 -07:00
memset.wgsl	ggml WebGPU: add support for quantization types (#15440 )	2025-08-22 11:28:03 -07:00
mul_mat.tmpl.wgsl	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
mul_mat_decls.tmpl	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
mul_mat_reg_tile.tmpl.wgsl	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
mul_mat_subgroup_matrix.tmpl.wgsl	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
mul_mat_vec.tmpl.wgsl	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
rms_norm.wgsl	ggml webgpu: actually add softmax, fix rms_norm offset (#16400 )	2025-10-04 20:59:31 -07:00
rope.tmpl.wgsl	model: add support for qwen3vl series (#16780 )	2025-10-30 16:19:14 +01:00
scale.tmpl.wgsl	ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187 )	2025-09-30 09:57:51 -07:00
set_rows.tmpl.wgsl	ggml webgpu: minor set rows optimization (#16810 )	2025-11-05 10:27:42 +01:00
soft_max.tmpl.wgsl	ggml webgpu: actually add softmax, fix rms_norm offset (#16400 )	2025-10-04 20:59:31 -07:00