llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Johannes Gäßler 1d72c84188 CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131 ) * CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16		2025-08-07 10:53:21 +02:00
..
cmake	cmake: Add GGML_BACKEND_DIR option (#15074 )	2025-08-04 21:29:14 +02:00
include	llama : add gpt-oss (#15091 )	2025-08-05 22:10:36 +03:00
src	CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131 )	2025-08-07 10:53:21 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	cmake: Add GGML_BACKEND_DIR option (#15074 )	2025-08-04 21:29:14 +02:00