llama.cpp/tools/mtmd/models
Tarek Dakhran c945aaaef2
mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876)
2026-01-16 11:23:08 +01:00
..
cogvlm.cpp
conformer.cpp mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876) 2026-01-16 11:23:08 +01:00
glm4v.cpp model: support GLM4V vision encoder (#18042) 2025-12-16 11:25:26 +01:00
internvl.cpp
kimivl.cpp
llama4.cpp
llava.cpp
minicpmv.cpp
mobilenetv5.cpp mtmd: Add Gemma3n multimodal support with MobileNetV5 vision encoder (#18256) 2026-01-09 23:42:38 +01:00
models.h mtmd: Add Gemma3n multimodal support with MobileNetV5 vision encoder (#18256) 2026-01-09 23:42:38 +01:00
pixtral.cpp
qwen2vl.cpp
qwen3vl.cpp
siglip.cpp model : mtmd : make input norm optional in LFM2-VL (#18594) 2026-01-04 18:50:02 +01:00
whisper-enc.cpp mtmd : Adding support for Nvidia Music Flamingo Model (#18470) 2025-12-31 12:13:23 +01:00
youtuvl.cpp model: support youtu-vl model (#18479) 2026-01-01 19:25:54 +01:00