mirror of https://github.com/google/gemma.cpp.git
Split attention into functions, move into class. Fuse Rope and MulBy, allow non-in-place version to avoid copy from q to KV. Sink if() into MaybeLogitsSoftCap. PiperOrigin-RevId: 661168418 |
||
|---|---|---|
| .. | ||
| matmul-inl.h | ||
| matmul_test.cc | ||
| matvec-inl.h | ||
| matvec_test.cc | ||
| ops-inl.h | ||
| ops_test.cc | ||