Jan Wassenberg
|
a0e808e341
|
Add compression/ comments, especially on SFP range
PiperOrigin-RevId: 642238720
|
2024-06-11 05:47:49 -07:00 |
Jan Wassenberg
|
4f9155d8c6
|
Add bf16 matmul support, update naming+test
Avoid int32, which can easily overflow for large matrices.
Also fix IDE warning in sfp-inl.
PiperOrigin-RevId: 640149845
|
2024-06-04 07:41:46 -07:00 |
Jan Wassenberg
|
a44cbdadc2
|
Update to Highway 1.2 for topology/VQSelect
Also fix unused-warning in compress-inl.
PiperOrigin-RevId: 639116915
|
2024-05-31 12:29:10 -07:00 |
Jan Wassenberg
|
c5c9fc300c
|
Enable even/odd for SFP. Refs #166
Disable it for float32 because there is not enough benefit.
PiperOrigin-RevId: 631788326
|
2024-05-08 07:09:06 -07:00 |
Jan Wassenberg
|
b5a9ade75f
|
2x speedup of SFP decode (1.4x overall) on AVX3_DL+.
Thanks @nzmichaelh for suggesting table lookups!
PiperOrigin-RevId: 631337524
|
2024-05-07 01:46:43 -07:00 |
Jan Wassenberg
|
a982ec1287
|
Move code to gemma/ so we can remove error-prone copybara: comments.
Also fix includes and Lint warnings.
PiperOrigin-RevId: 623127487
|
2024-04-09 04:45:42 -07:00 |
Jan Wassenberg
|
24add61dd9
|
Fix SFP/NUQ for bf16 rounding in Highway
SFP: Avoid rounding twice, and more robust TestDot.
NUQ: also more robust SNR, minor touchups to header.
PiperOrigin-RevId: 618030096
|
2024-03-21 19:06:19 -07:00 |
Austin Huang
|
e29cd566cf
|
initial commit
|
2024-02-21 03:31:22 +00:00 |