..
bench_matmul.cc
1.16x decode speedup: remove last MatVec in Attention
2025-06-02 09:40:29 -07:00
dot-inl.h
Huge refactor of weight handling and model loading.
2025-05-06 04:44:21 -07:00
dot_test.cc
Remove backprop/
2025-05-28 07:01:17 -07:00
fp_arith-inl.h
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
gemma_matvec_test.cc
Huge refactor of weight handling and model loading.
2025-05-06 04:44:21 -07:00
matmul-inl.h
1.07x batch decode speedup: more BF16 weights and activations
2025-06-03 23:30:18 -07:00
matmul.cc
Fix paligemma_test, refs #588
2025-06-03 04:45:22 -07:00
matmul.h
1.07x batch decode speedup: more BF16 weights and activations
2025-06-03 23:30:18 -07:00
matmul_static-inl.h
1.16x decode speedup: remove last MatVec in Attention
2025-06-02 09:40:29 -07:00
matmul_static.h
1.16x decode speedup: remove last MatVec in Attention
2025-06-02 09:40:29 -07:00
matmul_static_bf16.cc
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
matmul_static_f32.cc
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
matmul_static_nuq.cc
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
matmul_static_sfp.cc
Decouple MatMul from gemma-inl: precompile for all input types
2025-05-27 07:08:58 -07:00
matmul_test.cc
1.16x decode speedup: remove last MatVec in Attention
2025-06-02 09:40:29 -07:00
matvec-inl.h
Replace last ConstMat with MatPtr
2025-05-13 10:55:22 -07:00
ops-inl.h
1.07x batch decode speedup: more BF16 weights and activations
2025-06-03 23:30:18 -07:00
ops.h
Fix paligemma_test, refs #588
2025-06-03 04:45:22 -07:00
ops_test.cc
Fix paligemma_test, refs #588
2025-06-03 04:45:22 -07:00
sum-inl.h
Minor cleanup, Windows+Bazel build fixes
2024-10-10 09:05:06 -07:00