llama.cpp

History

Johannes Gäßler 75a3a6c2cd CUDA: refactor and deduplicate vector FA kernels (#16208 ) * CUDA: refactor and deduplicate vector FA kernels		2025-09-27 18:45:07 +02:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )	2025-08-07 13:45:41 +02:00
include	llama: print memory breakdown on exit (#15860 )	2025-09-24 16:53:48 +02:00
src	CUDA: refactor and deduplicate vector FA kernels (#16208 )	2025-09-27 18:45:07 +02:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	common : use cpp-httplib as a cURL alternative for downloads (#16185 )	2025-09-26 14:12:19 +03:00