llama.cpp

History

Aleksei Nikiforov 64387f6e95 gguf-py: byteswapping improvements (#12851 ) * gguf-py: implement byteswapping for Q4_0 This is needed to byteswap Mistral model. Also restore original shapes after byteswapping tensors. It is not needed at the moment, but do it in case they'd be used in future. * Rework byteswapping code in gguf-py Move out details from byteswapping tensor blocks code		2025-08-28 16:56:41 +08:00
..
gguf_convert_endian.py	gguf-py: byteswapping improvements (#12851 )	2025-08-28 16:56:41 +08:00
gguf_dump.py	gguf-py : dump bpw per layer and model in markdown mode (#14703 )	2025-07-16 00:04:42 +02:00
gguf_editor_gui.py	gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561 )	2025-05-29 15:36:05 +02:00
gguf_hash.py	gguf-py: fixed local detection of gguf package (#11180 )	2025-01-11 11:42:31 +02:00
gguf_new_metadata.py	gguf-py : add --chat-template-file to gguf_new_metadata (#15075 )	2025-08-04 21:01:48 +02:00
gguf_set_metadata.py	gguf-py: fixed local detection of gguf package (#11180 )	2025-01-11 11:42:31 +02:00