llama.cpp

History

Xuan-Son Nguyen 8f22dc0a53 model : add hunyuan moe (#14425 ) * model : add hunyuan moe * tokenizer ok * fix tensor name * cgraph init * chat template * wip * almost working * skip embed, fix bos * cleanup * yarn scaling * cleanup * correct rope type * failed token fix * ntk alpha freq_base * tokenization working * cleanup and pr changes * vocab_size sanity check * ntk alpha generic * Update convert_hf_to_gguf.py * Apply suggestions from code review * fix regression * fix style --------- Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>		2025-07-08 11:24:06 +03:00
..
scripts	gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561 )	2025-05-29 15:36:05 +02:00
__init__.py	…
constants.py	model : add hunyuan moe (#14425 )	2025-07-08 11:24:06 +03:00
gguf.py	…
gguf_reader.py	gguf-py : display the invalid gguf type (#13687 )	2025-05-21 16:33:54 +02:00
gguf_writer.py	convert : correct gemma 3n conversion (#14450 )	2025-07-03 10:03:06 +02:00
lazy.py	gguf-py : support lazy tensor splitting (#12809 )	2025-04-08 09:03:07 +02:00
metadata.py	convert : fix Norway problem when parsing YAML (#12114 )	2025-02-28 17:44:46 +01:00
py.typed	…
quants.py	ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151 )	2024-09-05 21:48:47 -04:00
tensor_mapping.py	model : add hunyuan moe (#14425 )	2025-07-08 11:24:06 +03:00
utility.py	gguf-py : fix SafetensorRemote return on undefined size (< 0) (#13841 )	2025-05-28 23:50:20 +02:00
vocab.py	gguf-py : add support for chat template jinja files (#14508 )	2025-07-02 21:02:35 +02:00