llama.cpp/ggml
Johannes Gäßler 1d72c84188
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)
* CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
2025-08-07 10:53:21 +02:00
..
cmake cmake: Add GGML_BACKEND_DIR option (#15074) 2025-08-04 21:29:14 +02:00
include llama : add gpt-oss (#15091) 2025-08-05 22:10:36 +03:00
src CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131) 2025-08-07 10:53:21 +02:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt cmake: Add GGML_BACKEND_DIR option (#15074) 2025-08-04 21:29:14 +02:00