llama.cpp

Charles Xu 4d74393bcc ggml: update kleidiai to v1.13.0 (#15663 )	2025-08-31 00:03:42 +08:00
..
amx	ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317 )	2025-06-25 23:49:04 +02:00
arch	ggml-cpu: Support Q5_0 and Q5_1 on s390x (#15486 )	2025-08-22 16:11:04 +08:00
cmake	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
kleidiai	ggml: update kleidiai to v1.13.0 (#15663 )	2025-08-31 00:03:42 +08:00
llamafile	llamafile: PowerPC Sgemm Optimization (#15558 )	2025-08-26 23:35:25 +08:00
CMakeLists.txt	ggml: update kleidiai to v1.13.0 (#15663 )	2025-08-31 00:03:42 +08:00
arch-fallback.h	ggml-cpu: Support Q5_0 and Q5_1 on s390x (#15486 )	2025-08-22 16:11:04 +08:00
binary-ops.cpp	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
binary-ops.h	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
common.h	ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317 )	2025-06-25 23:49:04 +02:00
ggml-cpu-impl.h	ggml-cpu: fix invalid hsum build in debug s390x (#15634 )	2025-08-28 22:39:27 +08:00
ggml-cpu.c	ggml: add `conv3d` op (#15182 )	2025-08-22 15:33:15 +02:00
ggml-cpu.cpp	ggml : fix field name when new ggml_backend (#14944 )	2025-08-08 14:37:22 +02:00
hbm.cpp	ggml-cpu : split arch-specific implementations (#13892 )	2025-06-09 16:47:13 +02:00
hbm.h	ggml-cpu : split arch-specific implementations (#13892 )	2025-06-09 16:47:13 +02:00
ops.cpp	ggml : fix SSM_SCAN for n_groups > 1 (#15625 )	2025-08-28 10:11:36 -04:00
ops.h	ggml: add `conv3d` op (#15182 )	2025-08-22 15:33:15 +02:00
quants.c	llama : add gpt-oss (#15091 )	2025-08-05 22:10:36 +03:00
quants.h	llama : add gpt-oss (#15091 )	2025-08-05 22:10:36 +03:00
repack.cpp	ggml : repack block_iq4_nlx8 (#14904 )	2025-08-13 11:09:39 +03:00
repack.h	ggml : repack block_iq4_nlx8 (#14904 )	2025-08-13 11:09:39 +03:00
simd-mappings.h	ggml-cpu : add basic RVV support for vector f32 ops (#15057 )	2025-08-27 16:44:22 +08:00
traits.cpp	ggml : fix fallback to CPU for ununsupported ops (#15118 )	2025-08-06 14:37:35 +02:00
traits.h	ggml : fix fallback to CPU for ununsupported ops (#15118 )	2025-08-06 14:37:35 +02:00
unary-ops.cpp	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
unary-ops.h	cpu: de-duplicate some of the operators and refactor (ggml/1144)	2025-03-30 08:33:31 +03:00
vec.cpp	ggml-cpu : add basic RVV support for vector f32 ops (#15057 )	2025-08-27 16:44:22 +08:00
vec.h	ggml-cpu : add basic RVV support for vector f32 ops (#15057 )	2025-08-27 16:44:22 +08:00