Commit Graph

8 Commits

Author SHA1 Message Date
Johannes Gäßler 4dc3d10e80 Remove shfl and AllReduce from backend interface 2026-02-11 14:51:37 +01:00
Johannes Gäßler 8de41b5b40 NCCL support 2026-02-11 14:12:33 +01:00
Johannes Gäßler c531444411 fix output pattern 2026-02-11 14:12:33 +01:00
Johannes Gäßler c925563499 re-use buffers + ggml contexts 2026-02-11 14:12:33 +01:00
Johannes Gäßler 2ffa49decc add support for 4/8 GPUs 2026-02-11 14:12:33 +01:00
Johannes Gäßler 4b8aa26650 partial Vulkan fix 2026-02-11 14:12:33 +01:00
Johannes Gäßler ab69c58aaa support for GPT-OSS, Qwen 3 MoE 2026-02-11 14:12:33 +01:00
Johannes Gäßler a0d9dd20ee ggml: backend-agnostic tensor parallelism 2026-02-11 14:12:33 +01:00