llama.cpp/tools/mtmd/models
AesSedai e463bbdf65
model: Add Kimi-K2.5 support (#19170)
* Move dequant_model to after the text_config merge
Add new kimi-k2.5 keys to mtmd convert
Update V_MMPROJ tensor mapping for new mm_projector.proj keys
Update V_M_IMP_NORM for new mm_projector.pre_norm key

* Fix a couple of oversights

* Add image support for Kimi-K2.5

* Revert changes to KimiVLForConditionalGeneration

* Fix an assert crash

* Fix permute swapping w / h on accident

* Kimi-K2.5: Use merged QKV for vision

* Kimi-K2.5: pre-convert vision QK to use build_rope_2d

* Kimi-K2.5: support non-interleaved rope for vision

* Kimi-K2.5: fix min / max pixel

* Kimi-K2.5: remove v/o permutes, unnecessary

* Kimi-K2.5: update permute name to match

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Kimi-K2.5: replace build_rope_2d ggml_cont with ggml_view_3d pointers

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-02-11 16:47:30 +01:00
..
cogvlm.cpp
conformer.cpp mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876) 2026-01-16 11:23:08 +01:00
glm4v.cpp model: support GLM4V vision encoder (#18042) 2025-12-16 11:25:26 +01:00
internvl.cpp
kimik25.cpp model: Add Kimi-K2.5 support (#19170) 2026-02-11 16:47:30 +01:00
kimivl.cpp
llama4.cpp
llava.cpp
minicpmv.cpp
mobilenetv5.cpp mtmd: Add Gemma3n multimodal support with MobileNetV5 vision encoder (#18256) 2026-01-09 23:42:38 +01:00
models.h model: Add Kimi-K2.5 support (#19170) 2026-02-11 16:47:30 +01:00
pixtral.cpp
qwen2vl.cpp
qwen3vl.cpp models : support qwen3.5 series (#19468) 2026-02-10 18:00:26 +02:00
siglip.cpp model : mtmd : make input norm optional in LFM2-VL (#18594) 2026-01-04 18:50:02 +01:00
whisper-enc.cpp mtmd : Adding support for Nvidia Music Flamingo Model (#18470) 2025-12-31 12:13:23 +01:00
youtuvl.cpp model: support youtu-vl model (#18479) 2026-01-01 19:25:54 +01:00