Saba Fallah
d70f171fac
merge with changes from https://github.com/ggml-org/llama.cpp/pull/17909
...
added new opt to tests.sh to disable flash-attn
2025-12-11 10:11:27 +01:00
Saba Fallah
33fabf0bd8
Merge branch 'master' into sf/deepseek-ocr-merge-test
...
# Conflicts:
# tools/mtmd/clip.cpp
# tools/mtmd/mtmd-cli.cpp
2025-12-11 08:13:50 +01:00
Saba Fallah
aaf2fd17bb
minor: editconfig-check fix
2025-12-11 07:31:08 +01:00
Xuan-Son Nguyen
c6b2c9310c
mtmd: some small clean up ( #17909 )
...
* clip: add support for fused qkv in build_vit
* use bulid_ffn whenever possible
* fix internvl
* mtmd-cli: move image to beginning
* test script: support custom args
2025-12-10 22:20:06 +01:00
Xuan-Son Nguyen
34a6d86982
cli: enable jinja by default ( #17911 )
...
* cli: enable jinja by default
* Update common/arg.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-12-10 22:19:42 +01:00
Saba Fallah
ed944cd25b
fix: test-1.jpg ORC issue with small (640) resolution
...
setting min-resolution base (1024) max large (1280) for dynamic-resolution
2025-12-10 20:20:55 +01:00
Georgi Gerganov
4dff236a52
ggml : remove GGML_KQ_MASK_PAD constant ( #17910 )
...
* ggml : remove GGML_KQ_MASK_PAD constant
* cont : remove comment
2025-12-10 20:53:16 +02:00
Xuan-Son Nguyen
6c2131773c
cli: new CLI experience ( #17824 )
...
* wip
* wip
* fix logging, add display info
* handle commands
* add args
* wip
* move old cli to llama-completion
* rm deprecation notice
* move server to a shared library
* move ci to llama-completion
* add loading animation
* add --show-timings arg
* add /read command, improve LOG_ERR
* add args for speculative decoding, enable show timings by default
* add arg --image and --audio
* fix windows build
* support reasoning_content
* fix llama2c workflow
* color default is auto
* fix merge conflicts
* properly fix color problem
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
* better loading spinner
* make sure to clean color on force-exit
* also clear input files on "/clear"
* simplify common_log_flush
* add warning in mtmd-cli
* implement console writter
* fix data race
* add attribute
* fix llama-completion and mtmd-cli
* add some notes about console::log
* fix compilation
---------
Co-authored-by: bandoti <bandoti@users.noreply.github.com>
2025-12-10 15:28:59 +01:00
bluebread
016140699f
mtmd: remove tweak to llama-mtmd-cli & deepseek-ocr template
2025-12-09 16:31:44 +00:00
Rhys-T
63908b631a
cmake: fix Mach-O current version number ( #17877 )
...
PR #17091 set the VERSION of various libraries to 0.0.abcd, where abcd
is the LLAMA_BUILD_NUMBER. That build number is too large to fit in the
Mach-O 'current version' field's 'micro' part, which only goes up to
255. This just sets the Mach-O current version to 0 to get it building
properly again.
Fixes #17258 .
2025-12-09 13:17:41 +02:00
bluebread
5174a1e69a
mtmd: minor fix
2025-12-08 04:54:19 +00:00
bluebread
48c6cf2132
mtmd: convert model in FP16
2025-12-08 02:36:00 +00:00
bluebread
53273f83f8
mtmd: fixed wrong input setting
2025-12-07 23:58:22 +00:00
bluebread
5dfcc5abb1
mtmd: add detailed comments for resize_bicubic_pillow
2025-12-07 10:15:09 +00:00
bluebread
2d918b3e21
mtmd: make sam hparams configurable
2025-12-06 06:55:53 +00:00
bluebread
15f2ada0ed
mtmd: simplify get_rel_pos
2025-12-06 06:32:41 +00:00
Saba Fallah
705394c27a
minor editorconfig-check fixes
2025-12-05 13:27:52 +01:00
Saba Fallah
d981f19e9d
minor editorconfig-check fixes
2025-12-05 13:18:15 +01:00
Saba Fallah
5f2ee1aecf
Merge branch 'ggml-org:master' into sf/deepseek-ocr
2025-12-05 11:56:06 +01:00
Saba Fallah
f5bd310a5e
minor formatting and style
2025-12-05 09:30:58 +01:00
Saba Fallah
076138a428
corrected code-branch when flash-attn disabled
...
enabling usage of --flash-attn option
2025-12-04 23:45:59 +01:00
Saba Fallah
5381b9cf63
using common build_attn in sam
2025-12-04 23:13:29 +01:00
bluebread
fc3f625fef
mtmd: support combined QKV projection in buid_vit
2025-12-04 17:57:43 +00:00
Saba Fallah
a661c52990
reverting automatically removed spaces
2025-12-04 16:12:41 +01:00
Saba Fallah
c73748ab5d
Merge branch 'sf/deepseek-ocr' into sf/deepseek-ocr-cleanup
...
# Conflicts:
# gguf-py/gguf/tensor_mapping.py
2025-12-04 15:09:32 +01:00
Saba Fallah
386ba479a2
clean up
2025-12-04 15:05:58 +01:00
bluebread
7451b84105
mtmd: fix tensor names for image newlines and view separator
2025-12-04 13:26:53 +00:00
bluebread
b26b507c4e
mtmd: refactor code & remove unused helper functions
2025-12-03 16:23:46 +00:00
bluebread
b696c54756
mtmd: remove --dsocr-mode argument
2025-12-03 14:54:16 +00:00
bluebread
43dfc0c8d6
Merge branch 'sf/deepseek-ocr' of github.com:sfallah/llama.cpp into sf/deepseek-ocr
2025-12-03 07:52:26 +00:00
bluebread
e20857ba59
mtmd: simplify DeepSeek-OCR dynamic resolution preprocessing
2025-12-03 07:51:12 +00:00
bluebread
c914e05405
mtmd: adapt Pillow image resizing function
2025-12-03 05:18:39 +00:00
Xuan-Son Nguyen
a96283adc4
mtmd: fix --no-warmup ( #17695 )
2025-12-02 22:48:08 +01:00
Saba Fallah
66341666fb
Merge branch 'master' into sf/deepseek-ocr
...
# Conflicts:
# convert_hf_to_gguf.py
# tools/mtmd/clip.h
# tools/mtmd/mtmd.cpp
2025-12-02 21:02:13 +01:00
Xuan-Son Nguyen
ecf74a8417
mtmd: add mtmd_context_params::warmup option ( #17652 )
...
* mtmd: add mtmd_context_params::warmup option
* reuse the common_params::warmup
2025-12-01 21:32:25 +01:00
bluebread
95239f92b9
mtmd: simplify SAM patch embedding
2025-12-01 07:31:24 +00:00
Tarek Dakhran
2ba719519d
model: LFM2-VL fixes ( #17577 )
...
* Adjust to pytorch
* Add antialiasing upscale
* Increase number of patches to 1024
* Handle default marker insertion for LFM2
* Switch to flag
* Reformat
* Cuda implementation of antialias kernel
* Change placement in ops.cpp
* consistent float literals
* Pad only for LFM2
* Address PR feedback
* Rollback default marker placement changes
* Fallback to CPU implementation for antialias implementation of upscale
2025-11-30 21:57:31 +01:00
bluebread
c5f4c64fe4
mtmd : add --dsocr-mode CLI argument for DeepSeek-OCR resolution control & all native resolution modes work
2025-11-30 16:57:19 +00:00
Xuan-Son Nguyen
7f8ef50cce
clip: fix nb calculation for qwen3-vl ( #17594 )
2025-11-30 15:33:55 +01:00
bluebread
55430945ef
Merge branch 'sf/deepseek-ocr' of github.com:sfallah/llama.cpp into sf/deepseek-ocr
2025-11-30 08:55:29 +00:00
Saba Fallah
ed3b7f1056
Merge remote-tracking branch 'sfallah/master' into sf/deepseek-ocr
...
# Conflicts:
# convert_hf_to_gguf.py
# src/llama-model.cpp
# src/models/deepseek2.cpp
2025-11-30 08:29:09 +01:00
bluebread
841a4a88df
mtmd: debug CLIP-L & first working DeepSeek-OCR model
2025-11-29 16:40:50 +00:00
bluebread
ccb2f2385e
mtmd: debug CLIP-L (vit_pre_ln)
2025-11-29 07:04:14 +00:00
bluebread
a488b495f7
mtmd: SAM numerically works
2025-11-29 02:17:49 +00:00
Han Qingzhe
1d594c295c
clip: (minicpmv) fix resampler kq_scale ( #17516 )
...
* debug:"solve minicpmv precision problem"
* “debug minicpmv”
* Apply suggestion from @ngxson
---------
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
2025-11-26 21:44:07 +01:00
bluebread
81533e494e
mtmd: fix danling pointer
2025-11-24 09:02:03 +00:00
bluebread
40e7e6e706
mtmd: quick fix token order
2025-11-24 08:16:32 +00:00
Saba Fallah
206f8abc3c
- dynamic resizing
...
- changes are concerning PR https://github.com/sfallah/llama.cpp/pull/4
2025-11-23 20:27:02 +01:00
Saba Fallah
6dfda99c69
Merge branch 'sf/deepseek-ocr' into sf/deepseek-ocr
2025-11-23 12:29:37 +01:00
bluebread
3f71188303
mtmd: correct token order
2025-11-23 09:22:00 +00:00