Jan Wassenberg
|
5c0da8c8c3
|
Minor cleanup/fixes:
- optimize_test simplify prompt check
- Fix SFP arg case
- Fix includes
- Align inputs in test
- IsInside: add DASSERT
- Fix PerClusterPool NumThreads
PiperOrigin-RevId: 672530385
|
2024-09-09 06:58:09 -07:00 |
Jan Wassenberg
|
c29e9752c7
|
Refactor/cleanup, remove even_odd
* New compression/shared.h, remove sfp.h
* Remove unused DistortionStats b_l1_
* Move exact arithmetic functions into fp_arith
* Remove even_odd optimization for MatVec (mostly unused)
* use BF16 typedef more widely
* Add kMaxSFP constant
PiperOrigin-RevId: 670996386
|
2024-09-04 09:25:13 -07:00 |
The gemma.cpp Authors
|
c1f243c351
|
Fix setting scales in Py binding
PiperOrigin-RevId: 655284183
|
2024-07-23 13:32:50 -07:00 |
Jan Wassenberg
|
f823371691
|
Cleanup: move util/compress and convert_weights to compression/
Also remove unused models/, lint convert_weights
PiperOrigin-RevId: 649613088
|
2024-07-05 04:16:52 -07:00 |
Jan Wassenberg
|
41efec4dba
|
Add Py bindings for weight compression
TODO: this uses clif instead of pybind11, and depends on absl.
PiperOrigin-RevId: 649575815
|
2024-07-05 01:06:00 -07:00 |