llama.cpp/ggml/src/ggml-cpu
junchao-zhao aa719c2f88
ggml : fix loongarch lsx compilation error (#15864)
2025-09-25 12:22:55 +03:00
..
amx ggml-amx : fix ggml_amx_init() on generic Linux (#16049) 2025-09-18 23:07:26 +02:00
arch ggml : fix loongarch lsx compilation error (#15864) 2025-09-25 12:22:55 +03:00
cmake ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
kleidiai kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed (#15614) 2025-09-11 12:45:40 +02:00
llamafile llamafile: PowerPC Sgemm Optimization (#15558) 2025-08-26 23:35:25 +08:00
CMakeLists.txt ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (#15922) 2025-09-11 14:39:12 +01:00
arch-fallback.h ggml-cpu: Support Q5_0 and Q5_1 on s390x (#15486) 2025-08-22 16:11:04 +08:00
binary-ops.cpp cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
binary-ops.h cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
common.h ggml : refactor forward_dup for cpu backend (#16062) 2025-09-19 06:31:56 +02:00
ggml-cpu-impl.h ggml-cpu: clean up s390x SIMD (#15855) 2025-09-08 02:18:28 +08:00
ggml-cpu.c ggml-cpu: Respect cpumask settings (#16164) 2025-09-23 11:58:12 +03:00
ggml-cpu.cpp rename optimize_graph to graph_optimize (#16082) 2025-09-18 13:46:17 -05:00
hbm.cpp ggml-cpu : split arch-specific implementations (#13892) 2025-06-09 16:47:13 +02:00
hbm.h ggml-cpu : split arch-specific implementations (#13892) 2025-06-09 16:47:13 +02:00
ops.cpp ggml : implement set_rows with i32 index (#16159) 2025-09-22 19:13:00 +02:00
ops.h ggml: add ops for WAN video model (cuda && cpu) (#15669) 2025-09-04 10:38:49 +02:00
quants.c llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
quants.h llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
repack.cpp ggml : repack block_iq4_nlx8 (#14904) 2025-08-13 11:09:39 +03:00
repack.h ggml : repack block_iq4_nlx8 (#14904) 2025-08-13 11:09:39 +03:00
simd-mappings.h ggml : fix loongarch lsx compilation error (#15864) 2025-09-25 12:22:55 +03:00
traits.cpp ggml : fix fallback to CPU for ununsupported ops (#15118) 2025-08-06 14:37:35 +02:00
traits.h ggml : fix fallback to CPU for ununsupported ops (#15118) 2025-08-06 14:37:35 +02:00
unary-ops.cpp cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
unary-ops.h cpu: de-duplicate some of the operators and refactor (ggml/1144) 2025-03-30 08:33:31 +03:00
vec.cpp ggml-cpu : optimize RVV kernels (#15720) 2025-09-03 16:16:21 +08:00
vec.h ggml-cpu : optimize RVV kernels (#15720) 2025-09-03 16:16:21 +08:00