| .. |
|
bench_matmul.cc
|
1.16x decode speedup: remove last MatVec in Attention
|
2025-06-02 09:40:29 -07:00 |
|
dot-inl.h
|
Huge refactor of weight handling and model loading.
|
2025-05-06 04:44:21 -07:00 |
|
dot_test.cc
|
Further cleanup: separate MatMulEnv arg
|
2025-06-05 20:48:32 -07:00 |
|
fp_arith-inl.h
|
Decouple MatMul from gemma-inl: precompile for all input types
|
2025-05-27 07:08:58 -07:00 |
|
gemma_matvec_test.cc
|
Huge refactor of weight handling and model loading.
|
2025-05-06 04:44:21 -07:00 |
|
matmul-inl.h
|
Further cleanup: separate MatMulEnv arg
|
2025-06-05 20:48:32 -07:00 |
|
matmul.cc
|
Further cleanup: separate MatMulEnv arg
|
2025-06-05 20:48:32 -07:00 |
|
matmul.h
|
Further cleanup: separate MatMulEnv arg
|
2025-06-05 20:48:32 -07:00 |
|
matmul_static-inl.h
|
1.16x decode speedup: remove last MatVec in Attention
|
2025-06-02 09:40:29 -07:00 |
|
matmul_static.h
|
1.16x decode speedup: remove last MatVec in Attention
|
2025-06-02 09:40:29 -07:00 |
|
matmul_static_bf16.cc
|
Decouple MatMul from gemma-inl: precompile for all input types
|
2025-05-27 07:08:58 -07:00 |
|
matmul_static_f32.cc
|
Decouple MatMul from gemma-inl: precompile for all input types
|
2025-05-27 07:08:58 -07:00 |
|
matmul_static_nuq.cc
|
Decouple MatMul from gemma-inl: precompile for all input types
|
2025-05-27 07:08:58 -07:00 |
|
matmul_static_sfp.cc
|
Decouple MatMul from gemma-inl: precompile for all input types
|
2025-05-27 07:08:58 -07:00 |
|
matmul_test.cc
|
3x speedup of EmbedImagePatches - GEMM, not GEMV.
|
2025-06-04 01:18:52 -07:00 |
|
matvec-inl.h
|
Replace last ConstMat with MatPtr
|
2025-05-13 10:55:22 -07:00 |
|
ops-inl.h
|
Split gemma-inl into separate source files
|
2025-06-05 05:36:44 -07:00 |
|
ops.h
|
Fix paligemma_test, refs #588
|
2025-06-03 04:45:22 -07:00 |
|
ops_test.cc
|
Split gemma-inl into separate source files
|
2025-06-05 05:36:44 -07:00 |
|
sum-inl.h
|
Minor cleanup, Windows+Bazel build fixes
|
2024-10-10 09:05:06 -07:00 |