llama.cpp/tools/gguf-split
Georgi Gerganov 1d660d2fae
ci : use smaller model (#16168)
* ci : switch from gemma to qwen3 0.6b

* ci : use smaller model for some tests
2025-09-22 09:11:39 +03:00
..
CMakeLists.txt cmake : Do not install tools on iOS targets (#15903) 2025-09-16 09:54:44 +07:00
README.md llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
gguf-split.cpp llama : move end-user examples to tools directory (#13249) 2025-05-02 20:27:13 +02:00
tests.sh ci : use smaller model (#16168) 2025-09-22 09:11:39 +03:00

README.md

GGUF split Example

CLI to split / merge GGUF files.

Command line options:

  • --split: split GGUF to multiple GGUF, default operation.
  • --split-max-size: max size per split in M or G, f.ex. 500M or 2G.
  • --split-max-tensors: maximum tensors in each split: default(128)
  • --merge: merge multiple GGUF to a single GGUF.