Jan Wassenberg
|
c5c9fc300c
|
Enable even/odd for SFP. Refs #166
Disable it for float32 because there is not enough benefit.
PiperOrigin-RevId: 631788326
|
2024-05-08 07:09:06 -07:00 |
Jan Wassenberg
|
b5a9ade75f
|
2x speedup of SFP decode (1.4x overall) on AVX3_DL+.
Thanks @nzmichaelh for suggesting table lookups!
PiperOrigin-RevId: 631337524
|
2024-05-07 01:46:43 -07:00 |
Jan Wassenberg
|
a982ec1287
|
Move code to gemma/ so we can remove error-prone copybara: comments.
Also fix includes and Lint warnings.
PiperOrigin-RevId: 623127487
|
2024-04-09 04:45:42 -07:00 |
Jan Wassenberg
|
24add61dd9
|
Fix SFP/NUQ for bf16 rounding in Highway
SFP: Avoid rounding twice, and more robust TestDot.
NUQ: also more robust SNR, minor touchups to header.
PiperOrigin-RevId: 618030096
|
2024-03-21 19:06:19 -07:00 |
Austin Huang
|
e29cd566cf
|
initial commit
|
2024-02-21 03:31:22 +00:00 |