llama.cpp

History

Georgi Gerganov 6a2c6145a0 metal : extend mat-mat multiplication support (#16225 ) * metal : support mul_mm with src1->type == GGML_TYPE_F16 * metal : support mul_mm_id with src1->type == GGML_TYPE_F16 [no ci] * metal : mul_mm support ne00 % 32 != 0 * metal : support mul_mm_id with ne00 % 32 != 0 * cont : remove unnecessary unrolls * cont : simplify data loading * metal : optimize mul_mm when output bounds checks are not needed		2025-09-28 09:34:44 +03:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )	2025-08-07 13:45:41 +02:00
include	llama: print memory breakdown on exit (#15860 )	2025-09-24 16:53:48 +02:00
src	metal : extend mat-mat multiplication support (#16225 )	2025-09-28 09:34:44 +03:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	common : use cpp-httplib as a cURL alternative for downloads (#16185 )	2025-09-26 14:12:19 +03:00