llama.cpp

History

Alberto Cabrera Pérez c03a5a46f0 ggml-cpu: arm64: q6_K repack gemm and gemv (and generic) implementations (dotprod) (#19360 ) * First working version of GEMM and GEMV * interleave loads and compute * Clang-format * Added missing fallback. Removed tested TODO. * Swap M and N to be consistent with the repack template convention		2026-02-10 10:47:45 +00:00
..
arm	ggml-cpu: arm64: q6_K repack gemm and gemv (and generic) implementations (dotprod) (#19360 )	2026-02-10 10:47:45 +00:00
loongarch	ggml : LoongArch fixes (#16958 )	2025-11-03 08:40:02 +02:00
powerpc	ggml-cpu: add mxfp4 VSX intrinsics for Power9+ (ppc64le) hardware (#15385 )	2025-08-19 11:54:31 +03:00
riscv	ggml: replace hwcap with riscv_hwprobe for RVV detection (#17567 )	2025-11-29 14:56:31 +02:00
s390	ggml: add s390x cpu-feats (#16774 )	2025-11-02 08:48:23 +08:00
wasm	ggml-cpu : deduplicate scalar implementations (#14897 )	2025-07-28 17:40:24 +02:00
x86	ggml-cpu: use LUT for converting e8->f32 scales on x86 (#19288 )	2026-02-04 09:43:29 +08:00