llama.cpp/gguf-py/gguf/scripts
Aleksei Nikiforov 64387f6e95
gguf-py: byteswapping improvements (#12851)
* gguf-py: implement byteswapping for Q4_0

This is needed to byteswap Mistral model.

Also restore original shapes after byteswapping tensors.
It is not needed at the moment, but do it in case
they'd be used in future.

* Rework byteswapping code in gguf-py

Move out details from byteswapping tensor blocks code
2025-08-28 16:56:41 +08:00
..
gguf_convert_endian.py gguf-py: byteswapping improvements (#12851) 2025-08-28 16:56:41 +08:00
gguf_dump.py gguf-py : dump bpw per layer and model in markdown mode (#14703) 2025-07-16 00:04:42 +02:00
gguf_editor_gui.py gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561) 2025-05-29 15:36:05 +02:00
gguf_hash.py gguf-py: fixed local detection of gguf package (#11180) 2025-01-11 11:42:31 +02:00
gguf_new_metadata.py gguf-py : add --chat-template-file to gguf_new_metadata (#15075) 2025-08-04 21:01:48 +02:00
gguf_set_metadata.py gguf-py: fixed local detection of gguf package (#11180) 2025-01-11 11:42:31 +02:00