Jan Wassenberg
|
c29e9752c7
|
Refactor/cleanup, remove even_odd
* New compression/shared.h, remove sfp.h
* Remove unused DistortionStats b_l1_
* Move exact arithmetic functions into fp_arith
* Remove even_odd optimization for MatVec (mostly unused)
* use BF16 typedef more widely
* Add kMaxSFP constant
PiperOrigin-RevId: 670996386
|
2024-09-04 09:25:13 -07:00 |
Thomas Fischbacher
|
d9f86f8e4d
|
Add Python code for converting Griffin Orbax weights. Refs #301
PiperOrigin-RevId: 657296255
|
2024-07-29 12:53:30 -07:00 |
The gemma.cpp Authors
|
c1f243c351
|
Fix setting scales in Py binding
PiperOrigin-RevId: 655284183
|
2024-07-23 13:32:50 -07:00 |
Jan Wassenberg
|
f823371691
|
Cleanup: move util/compress and convert_weights to compression/
Also remove unused models/, lint convert_weights
PiperOrigin-RevId: 649613088
|
2024-07-05 04:16:52 -07:00 |
Jan Wassenberg
|
41efec4dba
|
Add Py bindings for weight compression
TODO: this uses clif instead of pybind11, and depends on absl.
PiperOrigin-RevId: 649575815
|
2024-07-05 01:06:00 -07:00 |