llama.cpp/tools/mtmd/models
megemini 237958db33
model: Add PaddleOCR-VL model support (#18825)
* support PaddleOCR-VL

* clip: update PaddleOCR model loader parameters to prevent OOM during warmup

* [update] add paddleocr vl text model instead of ernie4.5

* [update] restore change of minicpmv

* [update] format

* [update] format

* [update] positions and patch merge permute

* [update] mtmd_decode_use_mrope for paddleocr

* [update] image min/max pixels

* [update] remove set_limit_image_tokens

* upate: preprocess without padding

* clean up

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-02-19 17:05:25 +01:00
..
cogvlm.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
conformer.cpp mtmd : Fix ASR for LFM2.5-Audio-1.5B (#18876) 2026-01-16 11:23:08 +01:00
glm4v.cpp model: support GLM-OCR (#19677) 2026-02-18 17:51:40 +01:00
internvl.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
kimik25.cpp model: Add Kimi-K2.5 support (#19170) 2026-02-11 16:47:30 +01:00
kimivl.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
llama4.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
llava.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
minicpmv.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
mobilenetv5.cpp mtmd: Add Gemma3n multimodal support with MobileNetV5 vision encoder (#18256) 2026-01-09 23:42:38 +01:00
models.h model: Add PaddleOCR-VL model support (#18825) 2026-02-19 17:05:25 +01:00
nemotron-v2-vl.cpp mtmd : Add Nemotron Nano 12B v2 VL support (#19547) 2026-02-14 14:07:00 +01:00
paddleocr.cpp model: Add PaddleOCR-VL model support (#18825) 2026-02-19 17:05:25 +01:00
pixtral.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
qwen2vl.cpp clip: move model cgraphs into their own files (#17965) 2025-12-12 21:14:48 +01:00
qwen3vl.cpp models : support qwen3.5 series (#19468) 2026-02-10 18:00:26 +02:00
siglip.cpp model : mtmd : make input norm optional in LFM2-VL (#18594) 2026-01-04 18:50:02 +01:00
whisper-enc.cpp mtmd : Adding support for Nvidia Music Flamingo Model (#18470) 2025-12-31 12:13:23 +01:00
youtuvl.cpp model: support youtu-vl model (#18479) 2026-01-01 19:25:54 +01:00