mirror of https://github.com/google/gemma.cpp.git
~8x reduction (tested on few prompts) in Rope. ~3.8% prefill latency improvement. ~2.6% decode latency improvement. PiperOrigin-RevId: 664650108 |
||
|---|---|---|
| .. | ||
| gemma_matvec_test.cc | ||
| matmul-inl.h | ||
| matmul.h | ||
| matmul_test.cc | ||
| matvec-inl.h | ||
| ops-inl.h | ||
| ops_test.cc | ||