gemma.cpp/util
Luca Versari 4c23932289 Improve weight handling.
- Allow scaling of SFP weights
- Allow using uncompressed weights
- Do not try to compress weights in the main model calls
- Reduce code duplication in weight handling with some macros

Co-authored-by: Eugene Kliuchnikov <eustas@google.com>
Co-authored-by: Thomas Fischbacher <tfish@google.com>
Co-authored-by: Zoltan Szabadka <szabadka@google.com>
2024-04-06 11:08:47 +02:00
..
app.h Improve weight handling. 2024-04-06 11:08:47 +02:00
args.h Add standalone tool to compress weights. 2024-04-03 14:54:08 +00:00
convert_weights.py Add MQA support 2024-03-20 18:17:24 +08:00