Michael Grau
6729d4920c
model : add control vector support where missing ( #20653 )
...
* Add control vector functions to qwen3.5 and qwen-next models
* Add missing cvec compatibility to the rest of the models
* Adjust comments and formatting
* cleanup
* whitespace
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-03-18 23:25:12 +01:00
Xuan-Son Nguyen
59db9a357d
llama: dynamic head_dim and n_rot for SWA ( #20301 )
...
* llama: dynamic head_dim and n_rot for SWA
* also add gguf_writer wrappers
* fix build
* build_rope_shift arg reorder
2026-03-09 22:22:39 +01:00
Sigbjørn Skjæret
35bee031e1
graph : remove redundant scale_w parameter ( #20235 )
2026-03-08 18:58:28 +01:00
forforever73
b83111815e
model : support Step3.5-Flash ( #19283 )
...
* Support Step3.5-Flash
* fix: norm.weight + 1 (HF zero_centered=true)
* step35: simplify GGUF conversion + drop redundant rope KVs
* Address review feedback
* rename limits -> clamp
* Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Apply suggestion from @CISC
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* rename swiglu limits -> swiglu clamp in LLM_KV
* avoid CI fail
* Apply suggestions from code review
* Apply suggestions from code review
* disabled KV shifting for LLM_ARCH_STEP35
* Apply suggestions from code review
* mistakenly removed cmath
* add model size && apply missed suggestion
* assert partial_rotary_factors
* fix CI errors:
* load freq_base_swa
---------
Co-authored-by: lvyichen <lvyichen@stepfun.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-02-06 21:06:14 +01:00