llama.cpp/ggml/src/ggml-cpu/arch
abhijain1204fujitsu 267ba5a1d9
ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k Kernel (#19132)
* Updated repack.cpp

* Updated repack.cpp

* Updated repack.cpp

* Added if condition to support only vector length 256.

* Changed the format removed comments and duplicate variable

* If SVE 256 not present then was using generic function to compute, hence slowing the performance. 

So added code if SVE 256 is not present then use NEON code.

* Code format change suggestion

---------

Co-authored-by: Vithule, Prashant <Prashant.Vithule@fujitsu.com>
2026-02-16 14:38:43 +08:00
..
arm ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k Kernel (#19132) 2026-02-16 14:38:43 +08:00
loongarch ggml : LoongArch fixes (#16958) 2025-11-03 08:40:02 +02:00
powerpc ggml-cpu: add mxfp4 VSX intrinsics for Power9+ (ppc64le) hardware (#15385) 2025-08-19 11:54:31 +03:00
riscv ggml: replace hwcap with riscv_hwprobe for RVV detection (#17567) 2025-11-29 14:56:31 +02:00
s390 ggml: add s390x cpu-feats (#16774) 2025-11-02 08:48:23 +08:00
wasm ggml-cpu : deduplicate scalar implementations (#14897) 2025-07-28 17:40:24 +02:00
x86 ggml-cpu: use LUT for converting e8->f32 scales on x86 (#19288) 2026-02-04 09:43:29 +08:00