llama.cpp

Commit Graph

Author	SHA1	Message	Date
Saba Fallah	d70f171fac	merge with changes from https://github.com/ggml-org/llama.cpp/pull/17909 added new opt to tests.sh to disable flash-attn	2025-12-11 10:11:27 +01:00
Saba Fallah	33fabf0bd8	Merge branch 'master' into sf/deepseek-ocr-merge-test # Conflicts: # tools/mtmd/clip.cpp # tools/mtmd/mtmd-cli.cpp	2025-12-11 08:13:50 +01:00
Saba Fallah	aaf2fd17bb	minor: editconfig-check fix	2025-12-11 07:31:08 +01:00
Xuan-Son Nguyen	c6b2c9310c	mtmd: some small clean up (#17909 ) * clip: add support for fused qkv in build_vit * use bulid_ffn whenever possible * fix internvl * mtmd-cli: move image to beginning * test script: support custom args	2025-12-10 22:20:06 +01:00
Xuan-Son Nguyen	34a6d86982	cli: enable jinja by default (#17911 ) * cli: enable jinja by default * Update common/arg.cpp Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>	2025-12-10 22:19:42 +01:00
Saba Fallah	ed944cd25b	fix: test-1.jpg ORC issue with small (640) resolution setting min-resolution base (1024) max large (1280) for dynamic-resolution	2025-12-10 20:20:55 +01:00
Georgi Gerganov	4dff236a52	ggml : remove GGML_KQ_MASK_PAD constant (#17910 ) * ggml : remove GGML_KQ_MASK_PAD constant * cont : remove comment	2025-12-10 20:53:16 +02:00
Xuan-Son Nguyen	6c2131773c	cli: new CLI experience (#17824 ) * wip * wip * fix logging, add display info * handle commands * add args * wip * move old cli to llama-completion * rm deprecation notice * move server to a shared library * move ci to llama-completion * add loading animation * add --show-timings arg * add /read command, improve LOG_ERR * add args for speculative decoding, enable show timings by default * add arg --image and --audio * fix windows build * support reasoning_content * fix llama2c workflow * color default is auto * fix merge conflicts * properly fix color problem Co-authored-by: bandoti <bandoti@users.noreply.github.com> * better loading spinner * make sure to clean color on force-exit * also clear input files on "/clear" * simplify common_log_flush * add warning in mtmd-cli * implement console writter * fix data race * add attribute * fix llama-completion and mtmd-cli * add some notes about console::log * fix compilation --------- Co-authored-by: bandoti <bandoti@users.noreply.github.com>	2025-12-10 15:28:59 +01:00
bluebread	016140699f	mtmd: remove tweak to llama-mtmd-cli & deepseek-ocr template	2025-12-09 16:31:44 +00:00
Rhys-T	63908b631a	cmake: fix Mach-O current version number (#17877 ) PR #17091 set the VERSION of various libraries to 0.0.abcd, where abcd is the LLAMA_BUILD_NUMBER. That build number is too large to fit in the Mach-O 'current version' field's 'micro' part, which only goes up to 255. This just sets the Mach-O current version to 0 to get it building properly again. Fixes #17258.	2025-12-09 13:17:41 +02:00
bluebread	5174a1e69a	mtmd: minor fix	2025-12-08 04:54:19 +00:00
bluebread	48c6cf2132	mtmd: convert model in FP16	2025-12-08 02:36:00 +00:00
bluebread	53273f83f8	mtmd: fixed wrong input setting	2025-12-07 23:58:22 +00:00
bluebread	5dfcc5abb1	mtmd: add detailed comments for resize_bicubic_pillow	2025-12-07 10:15:09 +00:00
bluebread	2d918b3e21	mtmd: make sam hparams configurable	2025-12-06 06:55:53 +00:00
bluebread	15f2ada0ed	mtmd: simplify get_rel_pos	2025-12-06 06:32:41 +00:00
Saba Fallah	705394c27a	minor editorconfig-check fixes	2025-12-05 13:27:52 +01:00
Saba Fallah	d981f19e9d	minor editorconfig-check fixes	2025-12-05 13:18:15 +01:00
Saba Fallah	5f2ee1aecf	Merge branch 'ggml-org:master' into sf/deepseek-ocr	2025-12-05 11:56:06 +01:00
Saba Fallah	f5bd310a5e	minor formatting and style	2025-12-05 09:30:58 +01:00
Saba Fallah	076138a428	corrected code-branch when flash-attn disabled enabling usage of --flash-attn option	2025-12-04 23:45:59 +01:00
Saba Fallah	5381b9cf63	using common build_attn in sam	2025-12-04 23:13:29 +01:00
bluebread	fc3f625fef	mtmd: support combined QKV projection in buid_vit	2025-12-04 17:57:43 +00:00
Saba Fallah	a661c52990	reverting automatically removed spaces	2025-12-04 16:12:41 +01:00
Saba Fallah	c73748ab5d	Merge branch 'sf/deepseek-ocr' into sf/deepseek-ocr-cleanup # Conflicts: # gguf-py/gguf/tensor_mapping.py	2025-12-04 15:09:32 +01:00
Saba Fallah	386ba479a2	clean up	2025-12-04 15:05:58 +01:00
bluebread	7451b84105	mtmd: fix tensor names for image newlines and view separator	2025-12-04 13:26:53 +00:00
bluebread	b26b507c4e	mtmd: refactor code & remove unused helper functions	2025-12-03 16:23:46 +00:00
bluebread	b696c54756	mtmd: remove --dsocr-mode argument	2025-12-03 14:54:16 +00:00
bluebread	43dfc0c8d6	Merge branch 'sf/deepseek-ocr' of github.com:sfallah/llama.cpp into sf/deepseek-ocr	2025-12-03 07:52:26 +00:00
bluebread	e20857ba59	mtmd: simplify DeepSeek-OCR dynamic resolution preprocessing	2025-12-03 07:51:12 +00:00
bluebread	c914e05405	mtmd: adapt Pillow image resizing function	2025-12-03 05:18:39 +00:00
Xuan-Son Nguyen	a96283adc4	mtmd: fix --no-warmup (#17695 )	2025-12-02 22:48:08 +01:00
Saba Fallah	66341666fb	Merge branch 'master' into sf/deepseek-ocr # Conflicts: # convert_hf_to_gguf.py # tools/mtmd/clip.h # tools/mtmd/mtmd.cpp	2025-12-02 21:02:13 +01:00
Xuan-Son Nguyen	ecf74a8417	mtmd: add mtmd_context_params::warmup option (#17652 ) * mtmd: add mtmd_context_params::warmup option * reuse the common_params::warmup	2025-12-01 21:32:25 +01:00
bluebread	95239f92b9	mtmd: simplify SAM patch embedding	2025-12-01 07:31:24 +00:00
Tarek Dakhran	2ba719519d	model: LFM2-VL fixes (#17577 ) * Adjust to pytorch * Add antialiasing upscale * Increase number of patches to 1024 * Handle default marker insertion for LFM2 * Switch to flag * Reformat * Cuda implementation of antialias kernel * Change placement in ops.cpp * consistent float literals * Pad only for LFM2 * Address PR feedback * Rollback default marker placement changes * Fallback to CPU implementation for antialias implementation of upscale	2025-11-30 21:57:31 +01:00
bluebread	c5f4c64fe4	mtmd : add --dsocr-mode CLI argument for DeepSeek-OCR resolution control & all native resolution modes work	2025-11-30 16:57:19 +00:00
Xuan-Son Nguyen	7f8ef50cce	clip: fix nb calculation for qwen3-vl (#17594 )	2025-11-30 15:33:55 +01:00
bluebread	55430945ef	Merge branch 'sf/deepseek-ocr' of github.com:sfallah/llama.cpp into sf/deepseek-ocr	2025-11-30 08:55:29 +00:00
Saba Fallah	ed3b7f1056	Merge remote-tracking branch 'sfallah/master' into sf/deepseek-ocr # Conflicts: # convert_hf_to_gguf.py # src/llama-model.cpp # src/models/deepseek2.cpp	2025-11-30 08:29:09 +01:00
bluebread	841a4a88df	mtmd: debug CLIP-L & first working DeepSeek-OCR model	2025-11-29 16:40:50 +00:00
bluebread	ccb2f2385e	mtmd: debug CLIP-L (vit_pre_ln)	2025-11-29 07:04:14 +00:00
bluebread	a488b495f7	mtmd: SAM numerically works	2025-11-29 02:17:49 +00:00
Han Qingzhe	1d594c295c	clip: (minicpmv) fix resampler kq_scale (#17516 ) * debug:"solve minicpmv precision problem" * “debug minicpmv” * Apply suggestion from @ngxson --------- Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>	2025-11-26 21:44:07 +01:00
bluebread	81533e494e	mtmd: fix danling pointer	2025-11-24 09:02:03 +00:00
bluebread	40e7e6e706	mtmd: quick fix token order	2025-11-24 08:16:32 +00:00
Saba Fallah	206f8abc3c	- dynamic resizing - changes are concerning PR https://github.com/sfallah/llama.cpp/pull/4	2025-11-23 20:27:02 +01:00
Saba Fallah	6dfda99c69	Merge branch 'sf/deepseek-ocr' into sf/deepseek-ocr	2025-11-23 12:29:37 +01:00
bluebread	3f71188303	mtmd: correct token order	2025-11-23 09:22:00 +00:00

1 2 3

146 Commits