llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git

History

Francis Couture-Harpin 71bef66591 cuda : graceful fallback for Mamba-1 models with weird embd size		2025-07-02 03:49:36 -04:00
..
cmake	ggml-cpu : rework weak alias on apple targets (#14146 )	2025-06-16 13:54:15 +08:00
include	Merge branch 'master' into compilade/mamba2	2025-07-02 02:39:04 -04:00
src	cuda : graceful fallback for Mamba-1 models with weird embd size	2025-07-02 03:49:36 -04:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317 )	2025-06-25 23:49:04 +02:00