llama.cpp/ggml
Piotr Wilkin 57e5c878ff Converge implementation with export-graph-ops 2026-04-13 15:29:49 +02:00
..
cmake ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00
include Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run 2026-04-13 15:29:49 +02:00
src Converge implementation with export-graph-ops 2026-04-13 15:29:49 +02:00
.gitignore
CMakeLists.txt ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00