..
bench_matmul.cc
MatMul simplification, threading strategy improvements
2025-09-03 21:45:07 -07:00
dot-inl.h
f32 LoopKC: 1.37x(M=512), 1.19(M=128) single-K F32,BF16 matmul speedup on SKX
2025-08-28 08:55:50 -07:00
dot_test.cc
1.29x speedup: bf16 C1/C2. Extend most ops to any type, expand test coverage.
2025-09-01 06:34:04 -07:00
fp_arith-inl.h
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
gemma_matvec_test.cc
De-singleton ThreadingContext so callers can pass in their own
2025-07-22 02:08:46 -07:00
matmul-inl.h
MatMul simplification, threading strategy improvements
2025-09-03 21:45:07 -07:00
matmul.cc
MatMul simplification, threading strategy improvements
2025-09-03 21:45:07 -07:00
matmul.h
MatMul simplification, threading strategy improvements
2025-09-03 21:45:07 -07:00
matmul_static-inl.h
Add MMOptions as an argument to Matmul.
2025-09-01 23:46:39 -07:00
matmul_static.h
Add MMOptions as an argument to Matmul.
2025-09-01 23:46:39 -07:00
matmul_static_bf16.cc
Speed up builds by skipping rarely used targets
2025-06-17 05:44:20 -07:00
matmul_static_f32.cc
Speed up builds by skipping rarely used targets
2025-06-17 05:44:20 -07:00
matmul_static_nuq.cc
Speed up builds by skipping rarely used targets
2025-06-17 05:44:20 -07:00
matmul_static_sfp.cc
Speed up builds by skipping rarely used targets
2025-06-17 05:44:20 -07:00
matmul_test.cc
Simplify MatMul: remove F32 special case (build time)
2025-09-02 04:29:21 -07:00
matvec-inl.h
Replace last ConstMat with MatPtr
2025-05-13 10:55:22 -07:00
ops-inl.h
MatMul simplification, threading strategy improvements
2025-09-03 21:45:07 -07:00
ops.h
De-singleton ThreadingContext so callers can pass in their own
2025-07-22 02:08:46 -07:00
ops_test.cc
1.29x speedup: bf16 C1/C2. Extend most ops to any type, expand test coverage.
2025-09-01 06:34:04 -07:00
sum-inl.h
Minor cleanup, Windows+Bazel build fixes
2024-10-10 09:05:06 -07:00