llama.cpp

History

Reese Levine 647b960bd8 ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 ) * Faster tensors (#8) Add fast matrix and matrix/vector multiplication. * Use map for shader replacements instead of pair of strings		2025-11-07 19:27:20 -08:00
..
ISSUE_TEMPLATE	ggml: initial IBM zDNN backend (#14975 )	2025-08-15 21:11:22 +08:00
actions	ci : refactor sdk caching to minimize storage (#16414 )	2025-10-06 17:40:21 +02:00
workflows	ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )	2025-11-07 19:27:20 -08:00
copilot-instructions.md	ci : add copilot-instructions.md (#15286 )	2025-08-21 11:47:52 +02:00
labeler.yml	ci : apply model label to models (#16994 )	2025-11-04 12:29:39 +01:00
pull_request_template.md	repo : update links to new url (#11886 )	2025-02-15 16:40:57 +02:00