llama.cpp/docs/ops
Vishal Singh f1ac84119c
ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315)
* ggml-zendnn : add MUL_MAT_ID op support for MoE models
- Add MUL_MAT_ID op acceleration for Mixture-of-Experts models
- MUL_MAT_ID op fallback to CPU backend if total experts > 32
- Point ZenDNN lib to latest bits ZenDNN-2026-WW13

* ggml-zendnn : add braces to sgemm failure condition for consistency

Co-authored-by: Aaron Teo <taronaeo@gmail.com>

---------

Co-authored-by: Aaron Teo <taronaeo@gmail.com>
2026-04-03 12:19:08 +03:00
..
BLAS.csv docs(ggml): update backend ops (#18734) 2026-01-10 18:48:17 +08:00
CANN.csv docs : update ops.md for CANN backend (#18654) 2026-01-16 13:32:17 +01:00
CPU.csv docs: update CPU backend ops to mark POOL_1D as supported (#20304) 2026-03-10 21:31:24 +08:00
CUDA.csv
Metal.csv docs : fix Metal backend op support status in ops.md (#20779) 2026-03-20 11:06:38 +02:00
OpenCL.csv docs : update opencl ops (#17904) 2025-12-10 15:20:00 +01:00
SYCL.csv [SYCL] ehance UPSCALE to support all UT cases (#20637) 2026-03-17 10:01:52 +08:00
Vulkan.csv ggml-vulkan: add SGN operator, auto-generate Vulkan.csv and ops.md (#20219) 2026-03-09 07:24:16 +01:00
WebGPU.csv ggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE, SSM_CONV, GATED_DELTA_NET) + GET_ROWS optimization (#20687) 2026-03-19 08:45:28 -07:00
ZenDNN.csv ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315) 2026-04-03 12:19:08 +03:00
zDNN.csv docs(ggml): update backend ops (#18734) 2026-01-10 18:48:17 +08:00