llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Francis Couture-Harpin 946796fcec ggml-cuda : slight optimizations for TQ2_0 Co-authored-by: Johannes Gäßler <johannesg@5d6.de>		2025-01-11 21:06:41 -05:00
..
include	GGUF: C++ refactor, backend support, misc fixes (#11030 )	2025-01-07 18:01:58 +01:00
src	ggml-cuda : slight optimizations for TQ2_0	2025-01-11 21:06:41 -05:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	GGUF: C++ refactor, backend support, misc fixes (#11030 )	2025-01-07 18:01:58 +01:00