llama.cpp/ggml
Johannes Gäßler ff5ef82786
CUDA: skip compilation of superfluous FA kernels (#21768)
2026-04-11 18:52:11 +02:00
..
cmake ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00
include ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00
src CUDA: skip compilation of superfluous FA kernels (#21768) 2026-04-11 18:52:11 +02:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt ggml: backend-agnostic tensor parallelism (experimental) (#19378) 2026-04-09 16:42:19 +02:00