llama.cpp/ggml
Francis Couture-Harpin 71bef66591 cuda : graceful fallback for Mamba-1 models with weird embd size 2025-07-02 03:49:36 -04:00
..
cmake ggml-cpu : rework weak alias on apple targets (#14146) 2025-06-16 13:54:15 +08:00
include Merge branch 'master' into compilade/mamba2 2025-07-02 02:39:04 -04:00
src cuda : graceful fallback for Mamba-1 models with weird embd size 2025-07-02 03:49:36 -04:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317) 2025-06-25 23:49:04 +02:00