History

Vaibhav Srivastav 3775d0debb chore: add references to the quantisation space.		2024-05-14 23:01:11 +02:00
..
CMakeLists.txt	quantize: add imatrix and dataset metadata in GGUF (#6658 )	2024-04-26 20:06:33 +02:00
README.md	chore: add references to the quantisation space.	2024-05-14 23:01:11 +02:00
quantize.cpp	ggml : introduce bfloat16 support (#6412 )	2024-05-08 09:30:09 +03:00
tests.sh	tests : minor bash stuff (#6902 )	2024-04-25 14:27:20 +03:00

quantize

You can also use the GGUF-my-repo space on Hugging Face to build your own quants without any setup.

Note: It is synced to llama.cpp main every 6 hours.

Llama 2 7B