gemma.cpp/ops
Jan Wassenberg 3a266c662c Split gemma-inl into separate source files
weights, mat: zero-initialize padding, required since the MatMul "avoid B decompress" optimization.

PiperOrigin-RevId: 767562313
2025-06-05 05:36:44 -07:00
..
bench_matmul.cc 1.16x decode speedup: remove last MatVec in Attention 2025-06-02 09:40:29 -07:00
dot-inl.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
dot_test.cc Remove backprop/ 2025-05-28 07:01:17 -07:00
fp_arith-inl.h Decouple MatMul from gemma-inl: precompile for all input types 2025-05-27 07:08:58 -07:00
gemma_matvec_test.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
matmul-inl.h Split gemma-inl into separate source files 2025-06-05 05:36:44 -07:00
matmul.cc Fix paligemma_test, refs #588 2025-06-03 04:45:22 -07:00
matmul.h 3x speedup of EmbedImagePatches - GEMM, not GEMV. 2025-06-04 01:18:52 -07:00
matmul_static-inl.h 1.16x decode speedup: remove last MatVec in Attention 2025-06-02 09:40:29 -07:00
matmul_static.h 1.16x decode speedup: remove last MatVec in Attention 2025-06-02 09:40:29 -07:00
matmul_static_bf16.cc Decouple MatMul from gemma-inl: precompile for all input types 2025-05-27 07:08:58 -07:00
matmul_static_f32.cc Decouple MatMul from gemma-inl: precompile for all input types 2025-05-27 07:08:58 -07:00
matmul_static_nuq.cc Decouple MatMul from gemma-inl: precompile for all input types 2025-05-27 07:08:58 -07:00
matmul_static_sfp.cc Decouple MatMul from gemma-inl: precompile for all input types 2025-05-27 07:08:58 -07:00
matmul_test.cc 3x speedup of EmbedImagePatches - GEMM, not GEMV. 2025-06-04 01:18:52 -07:00
matvec-inl.h Replace last ConstMat with MatPtr 2025-05-13 10:55:22 -07:00
ops-inl.h Split gemma-inl into separate source files 2025-06-05 05:36:44 -07:00
ops.h Fix paligemma_test, refs #588 2025-06-03 04:45:22 -07:00
ops_test.cc Split gemma-inl into separate source files 2025-06-05 05:36:44 -07:00
sum-inl.h Minor cleanup, Windows+Bazel build fixes 2024-10-10 09:05:06 -07:00