Johannes Gäßler
|
4dc3d10e80
|
Remove shfl and AllReduce from backend interface
|
2026-02-11 14:51:37 +01:00 |
Johannes Gäßler
|
8de41b5b40
|
NCCL support
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
c531444411
|
fix output pattern
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
c925563499
|
re-use buffers + ggml contexts
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
2ffa49decc
|
add support for 4/8 GPUs
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
4b8aa26650
|
partial Vulkan fix
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
ab69c58aaa
|
support for GPT-OSS, Qwen 3 MoE
|
2026-02-11 14:12:33 +01:00 |
Johannes Gäßler
|
a0d9dd20ee
|
ggml: backend-agnostic tensor parallelism
|
2026-02-11 14:12:33 +01:00 |