Jan Wassenberg
|
9661b81c4b
|
Fix NUQ for SVE - incorrect nibble packing
Also speed up test
PiperOrigin-RevId: 670625545
|
2024-09-03 10:59:01 -07:00 |
Jan Wassenberg
|
aa11ddf5fc
|
1.22x NUQ compress speedup, fix out of bounds access, improve numerics
Also clarify the cost computation and move toward non-group-multiple num.
PiperOrigin-RevId: 670544245
|
2024-09-03 07:10:56 -07:00 |
Paul Chang
|
175e389c3c
|
revert back to HWY_ASSERT for lane constraints, qualify hn::Add
PiperOrigin-RevId: 640193239
|
2024-06-04 10:10:18 -07:00 |
Paul Chang
|
e8f59bb411
|
Fix underflow in NUQ ClusterCost()
PiperOrigin-RevId: 628137904
|
2024-04-25 11:28:51 -07:00 |
Jan Wassenberg
|
a982ec1287
|
Move code to gemma/ so we can remove error-prone copybara: comments.
Also fix includes and Lint warnings.
PiperOrigin-RevId: 623127487
|
2024-04-09 04:45:42 -07:00 |
Jan Wassenberg
|
24add61dd9
|
Fix SFP/NUQ for bf16 rounding in Highway
SFP: Avoid rounding twice, and more robust TestDot.
NUQ: also more robust SNR, minor touchups to header.
PiperOrigin-RevId: 618030096
|
2024-03-21 19:06:19 -07:00 |
Jan Wassenberg
|
bb9b023502
|
Support Bazel builds. Fixes #16
Also fix nuq/sfp-inl: warning, cast, and disable SCALAR
PiperOrigin-RevId: 612704056
|
2024-03-04 22:07:25 -08:00 |
enum-class
|
06dd013397
|
Add clang-tidy, fix narrowing issues, fix constness
|
2024-02-28 20:04:09 +08:00 |
Austin Huang
|
e29cd566cf
|
initial commit
|
2024-02-21 03:31:22 +00:00 |