mirror of https://github.com/google/gemma.cpp.git
Only the weights; binding MatMul output worsens batch=1 prefill. Update gemma_batch_bench to use --decode_qbatch. Fix/remove prefill_activations in gemma-inl.h. Refactor: use BasePageBytes directly when binding Move BindB/C to .cc by de-templatizing Remove MatOwners::AllocateFor because it is weights-specific (binding or not) Disband MatOwners, replace with vector PiperOrigin-RevId: 759610477 |
||
|---|---|---|
| .. | ||
| bench_matmul.cc | ||
| dot-inl.h | ||
| dot_test.cc | ||
| fp_arith-inl.h | ||
| gemma_matvec_test.cc | ||
| matmul-inl.h | ||
| matmul.cc | ||
| matmul.h | ||
| matmul_test.cc | ||
| matvec-inl.h | ||
| ops-inl.h | ||
| ops.h | ||
| ops_test.cc | ||
| sum-inl.h | ||