llama.cpp

History

Vishal Singh f1ac84119c ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 ) * ggml-zendnn : add MUL_MAT_ID op support for MoE models - Add MUL_MAT_ID op acceleration for Mixture-of-Experts models - MUL_MAT_ID op fallback to CPU backend if total experts > 32 - Point ZenDNN lib to latest bits ZenDNN-2026-WW13 * ggml-zendnn : add braces to sgemm failure condition for consistency Co-authored-by: Aaron Teo <taronaeo@gmail.com> --------- Co-authored-by: Aaron Teo <taronaeo@gmail.com>		2026-04-03 12:19:08 +03:00
..
VirtGPU	ggml-virtgpu: Fix some build commands (#20341 )	2026-03-12 15:47:45 +08:00
snapdragon	chore : correct typos [no ci] (#20041 )	2026-03-05 08:50:21 +01:00
BLIS.md	make : deprecate (#10514 )	2024-12-02 21:22:53 +02:00
CANN.md	CANN: update docker images to 8.5.0 and improve CANN.md (#20801 )	2026-03-27 08:53:00 +08:00
CUDA-FEDORA.md	docs: update: improve the Fedoa CUDA guide (#12536 )	2025-03-24 11:02:26 +00:00
OPENCL.md	docs: add linux to index (#18907 )	2026-01-18 18:03:35 +08:00
OPENVINO.md	docs : Update OpenVINO backend docs (#20968 )	2026-03-25 10:33:51 +02:00
SYCL.md	[SYCL] Update SYCL.md for binary package for Windows (#20401 )	2026-03-11 22:21:22 +08:00
VirtGPU.md	ggml-virtgpu: improve the reliability of the code (#19846 )	2026-02-26 20:00:57 +08:00
ZenDNN.md	ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315 )	2026-04-03 12:19:08 +03:00
zDNN.md	ggml-zendnn : add ZenDNN backend for AMD CPUs (#17690 )	2025-12-07 00:13:33 +08:00