ibrahim khadraoui
|
212edffd86
|
Update src/llama-arch.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-07-08 13:23:37 +04:00 |
ibrahim khadraoui
|
debf4e5dd5
|
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-07-08 13:23:19 +04:00 |
ibrahim khadraoui
|
40058c043f
|
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-07-08 13:23:10 +04:00 |
ibrahim khadraoui
|
7fe1794cc3
|
Update src/llama-hparams.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-07-08 13:22:56 +04:00 |
Younes B
|
d28c31a90c
|
Merge branch 'master' into add-fh1-rebased
|
2025-07-08 10:37:13 +02:00 |
Younes B
|
58e3866d02
|
Update src/llama-model.cpp
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-07-08 12:30:55 +04:00 |
Xuan-Son Nguyen
|
8f22dc0a53
|
model : add hunyuan moe (#14425)
* model : add hunyuan moe
* tokenizer ok
* fix tensor name
* cgraph init
* chat template
* wip
* almost working
* skip embed, fix bos
* cleanup
* yarn scaling
* cleanup
* correct rope type
* failed token fix
* ntk alpha freq_base
* tokenization working
* cleanup and pr changes
* vocab_size sanity check
* ntk alpha generic
* Update convert_hf_to_gguf.py
* Apply suggestions from code review
* fix regression
* fix style
---------
Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>
|
2025-07-08 11:24:06 +03:00 |
younesbelkada
|
097df0ed85
|
remove final_norm
|
2025-07-08 11:26:04 +04:00 |
younesbelkada
|
adff470c8a
|
more cleanups and fixed conversion
|
2025-07-08 11:19:38 +04:00 |
younesbelkada
|
823696bab1
|
remove unneeded attributes
|
2025-07-08 11:15:21 +04:00 |
younesbelkada
|
4bc9e0ca89
|
tensor not required
|
2025-07-08 10:56:34 +04:00 |
ibrahimkhadraoui
|
d41f111462
|
Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased
|
2025-07-08 10:48:07 +04:00 |
ibrahimkhadraoui
|
f028a43a91
|
Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased
|
2025-07-08 10:48:01 +04:00 |
younesbelkada
|
a846d02327
|
remove todo
|
2025-07-08 10:44:59 +04:00 |
ibrahimkhadraoui
|
7846c67e5c
|
minor cleanups
|
2025-07-08 10:42:15 +04:00 |
younesbelkada
|
d473d42832
|
more cleanups
|
2025-07-08 10:39:12 +04:00 |
ibrahimkhadraoui
|
e63ee4649e
|
cleanup
|
2025-07-08 10:31:12 +04:00 |
ibrahimkhadraoui
|
da8a338531
|
Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased
|
2025-07-08 10:23:18 +04:00 |
ibrahimkhadraoui
|
67b2664290
|
cleaning unused hparams
|
2025-07-08 10:20:17 +04:00 |
younesbelkada
|
7d7da0b37e
|
d_ssm -> d_inner;
|
2025-07-08 10:18:43 +04:00 |
Sigbjørn Skjæret
|
e1a7059053
|
llama : fix incorrect minicpm3 v_states shape (#14571)
|
2025-07-07 23:35:35 +02:00 |
Sigbjørn Skjæret
|
12f55c302b
|
llama : remove ggml_cont where possible (#14568)
|
2025-07-07 21:35:08 +02:00 |
younesbelkada
|
d2f46f18ac
|
moe cleanuips
|
2025-07-07 17:36:22 +04:00 |
younesbelkada
|
68cb7845e9
|
more cleanups
|
2025-07-07 17:34:20 +04:00 |
Younes B
|
fd203302aa
|
Update src/llama-model-loader.cpp
|
2025-07-07 17:29:50 +04:00 |
younesbelkada
|
084873c215
|
some cleanups
|
2025-07-07 17:28:08 +04:00 |
younesbelkada
|
632861e6c1
|
some cleanups
|
2025-07-07 17:27:34 +04:00 |
younesbelkada
|
f74e266f04
|
fix comment
|
2025-07-07 17:23:47 +04:00 |
ibrahimkhadraoui
|
042e5ff90b
|
cleaning debug quant
|
2025-07-07 17:21:54 +04:00 |
ibrahimkhadraoui
|
624699c53f
|
cleaning debugging stuff
|
2025-07-07 17:20:24 +04:00 |
ibrahimkhadraoui
|
935d46fab0
|
changed ROPE_TYPE
|
2025-07-07 17:01:54 +04:00 |
ibrahimkhadraoui
|
ae937f442c
|
rm unused key
|
2025-07-07 16:57:36 +04:00 |
ibrahimkhadraoui
|
53446f7e42
|
rm unused MAMBA_CHUNK_SIZE
|
2025-07-07 15:29:56 +04:00 |
ibrahimkhadraoui
|
0ad3502839
|
rm extra space
|
2025-07-07 15:26:46 +04:00 |
younesbelkada
|
a9f3a63dc1
|
injected mup
|
2025-07-07 15:00:25 +04:00 |
ibrahimkhadraoui
|
b3bc1fb237
|
Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp-public into add-fh1-rebased
|
2025-07-07 14:36:55 +04:00 |
ibrahimkhadraoui
|
286e1fa569
|
fix rope_theta
|
2025-07-07 14:36:51 +04:00 |
ibrahimkhadraoui
|
49d7420964
|
inp_out_ids moved outside of layers loop
|
2025-07-07 14:18:48 +04:00 |
ibrahimkhadraoui
|
8c50893820
|
added some cb functions for debugging puposes
|
2025-07-07 14:10:45 +04:00 |
Younes B
|
6c39e775dd
|
fix conversion and d_inner
|
2025-07-07 10:56:49 +02:00 |
ibrahimkhadraoui
|
7a25441e13
|
fixed multipliers
|
2025-07-04 17:41:03 +04:00 |
ibrahimkhadraoui
|
15138df48f
|
small fix ffn_norm
|
2025-07-04 15:37:40 +04:00 |
younesbelkada
|
22de62cf56
|
fix
|
2025-07-04 15:02:14 +04:00 |
younesbelkada
|
cce35498d5
|
pre-norm -> norm
|
2025-07-04 14:58:33 +04:00 |
younesbelkada
|
50eadc7b33
|
fixes
|
2025-07-04 14:47:31 +04:00 |
Georgi Gerganov
|
67d1ef23c6
|
batch : add optional for sequential equal split (#14511)
ggml-ci
|
2025-07-04 09:08:59 +03:00 |
Georgi Gerganov
|
7b50f7c025
|
graph : prepare for 4D mask (#14515)
ggml-ci
|
2025-07-04 09:05:36 +03:00 |
Georgi Gerganov
|
c79184d2d1
|
batch : add n_used count (#14512)
ggml-ci
|
2025-07-04 09:04:59 +03:00 |
younesbelkada
|
14c37ec047
|
more cleaning on python code
|
2025-07-03 18:09:30 +04:00 |
younesbelkada
|
fdd5cff4ba
|
minor fix
|
2025-07-03 17:12:05 +04:00 |