Commit Graph

522 Commits

Author SHA1 Message Date
ibrahim khadraoui 212edffd86
Update src/llama-arch.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-07-08 13:23:37 +04:00
ibrahim khadraoui debf4e5dd5
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-07-08 13:23:19 +04:00
ibrahim khadraoui 40058c043f
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-07-08 13:23:10 +04:00
ibrahim khadraoui 7fe1794cc3
Update src/llama-hparams.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-07-08 13:22:56 +04:00
Younes B d28c31a90c
Merge branch 'master' into add-fh1-rebased 2025-07-08 10:37:13 +02:00
Younes B 58e3866d02
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-07-08 12:30:55 +04:00
Xuan-Son Nguyen 8f22dc0a53
model : add hunyuan moe (#14425)
* model : add hunyuan moe

* tokenizer ok

* fix tensor name

* cgraph init

* chat template

* wip

* almost working

* skip embed, fix bos

* cleanup

* yarn scaling

* cleanup

* correct rope type

* failed token fix

* ntk alpha freq_base

* tokenization working

* cleanup and pr changes

* vocab_size sanity check

* ntk alpha generic

* Update convert_hf_to_gguf.py

* Apply suggestions from code review

* fix regression

* fix style

---------

Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>
2025-07-08 11:24:06 +03:00
younesbelkada 097df0ed85 remove final_norm 2025-07-08 11:26:04 +04:00
younesbelkada adff470c8a more cleanups and fixed conversion 2025-07-08 11:19:38 +04:00
younesbelkada 823696bab1 remove unneeded attributes 2025-07-08 11:15:21 +04:00
younesbelkada 4bc9e0ca89 tensor not required 2025-07-08 10:56:34 +04:00
ibrahimkhadraoui d41f111462 Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased 2025-07-08 10:48:07 +04:00
ibrahimkhadraoui f028a43a91 Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased 2025-07-08 10:48:01 +04:00
younesbelkada a846d02327 remove todo 2025-07-08 10:44:59 +04:00
ibrahimkhadraoui 7846c67e5c minor cleanups 2025-07-08 10:42:15 +04:00
younesbelkada d473d42832 more cleanups 2025-07-08 10:39:12 +04:00
ibrahimkhadraoui e63ee4649e cleanup 2025-07-08 10:31:12 +04:00
ibrahimkhadraoui da8a338531 Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased 2025-07-08 10:23:18 +04:00
ibrahimkhadraoui 67b2664290 cleaning unused hparams 2025-07-08 10:20:17 +04:00
younesbelkada 7d7da0b37e d_ssm -> d_inner; 2025-07-08 10:18:43 +04:00
Sigbjørn Skjæret e1a7059053
llama : fix incorrect minicpm3 v_states shape (#14571) 2025-07-07 23:35:35 +02:00
Sigbjørn Skjæret 12f55c302b
llama : remove ggml_cont where possible (#14568) 2025-07-07 21:35:08 +02:00
younesbelkada d2f46f18ac moe cleanuips 2025-07-07 17:36:22 +04:00
younesbelkada 68cb7845e9 more cleanups 2025-07-07 17:34:20 +04:00
Younes B fd203302aa
Update src/llama-model-loader.cpp 2025-07-07 17:29:50 +04:00
younesbelkada 084873c215 some cleanups 2025-07-07 17:28:08 +04:00
younesbelkada 632861e6c1 some cleanups 2025-07-07 17:27:34 +04:00
younesbelkada f74e266f04 fix comment 2025-07-07 17:23:47 +04:00
ibrahimkhadraoui 042e5ff90b cleaning debug quant 2025-07-07 17:21:54 +04:00
ibrahimkhadraoui 624699c53f cleaning debugging stuff 2025-07-07 17:20:24 +04:00
ibrahimkhadraoui 935d46fab0 changed ROPE_TYPE 2025-07-07 17:01:54 +04:00
ibrahimkhadraoui ae937f442c rm unused key 2025-07-07 16:57:36 +04:00
ibrahimkhadraoui 53446f7e42 rm unused MAMBA_CHUNK_SIZE 2025-07-07 15:29:56 +04:00
ibrahimkhadraoui 0ad3502839 rm extra space 2025-07-07 15:26:46 +04:00
younesbelkada a9f3a63dc1 injected mup 2025-07-07 15:00:25 +04:00
ibrahimkhadraoui b3bc1fb237 Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased 2025-07-07 14:36:55 +04:00
ibrahimkhadraoui 286e1fa569 fix rope_theta 2025-07-07 14:36:51 +04:00
ibrahimkhadraoui 49d7420964 inp_out_ids moved outside of layers loop 2025-07-07 14:18:48 +04:00
ibrahimkhadraoui 8c50893820 added some cb functions for debugging puposes 2025-07-07 14:10:45 +04:00
Younes B 6c39e775dd
fix conversion and d_inner 2025-07-07 10:56:49 +02:00
ibrahimkhadraoui 7a25441e13 fixed multipliers 2025-07-04 17:41:03 +04:00
ibrahimkhadraoui 15138df48f small fix ffn_norm 2025-07-04 15:37:40 +04:00
younesbelkada 22de62cf56 fix 2025-07-04 15:02:14 +04:00
younesbelkada cce35498d5 pre-norm -> norm 2025-07-04 14:58:33 +04:00
younesbelkada 50eadc7b33 fixes 2025-07-04 14:47:31 +04:00
Georgi Gerganov 67d1ef23c6
batch : add optional for sequential equal split (#14511)
ggml-ci
2025-07-04 09:08:59 +03:00
Georgi Gerganov 7b50f7c025
graph : prepare for 4D mask (#14515)
ggml-ci
2025-07-04 09:05:36 +03:00
Georgi Gerganov c79184d2d1
batch : add n_used count (#14512)
ggml-ci
2025-07-04 09:04:59 +03:00
younesbelkada 14c37ec047 more cleaning on python code 2025-07-03 18:09:30 +04:00
younesbelkada fdd5cff4ba minor fix 2025-07-03 17:12:05 +04:00