gemma.cpp

History

Luca Versari 4c23932289 Improve weight handling. - Allow scaling of SFP weights - Allow using uncompressed weights - Do not try to compress weights in the main model calls - Reduce code duplication in weight handling with some macros Co-authored-by: Eugene Kliuchnikov <eustas@google.com> Co-authored-by: Thomas Fischbacher <tfish@google.com> Co-authored-by: Zoltan Szabadka <szabadka@google.com>		2024-04-06 11:08:47 +02:00
..
app.h	Improve weight handling.	2024-04-06 11:08:47 +02:00
args.h	Add standalone tool to compress weights.	2024-04-03 14:54:08 +00:00
convert_weights.py	Add MQA support	2024-03-20 18:17:24 +08:00