mirror of https://github.com/google/gemma.cpp.git
Split attention into functions, move into class. Fuse Rope and MulBy, allow non-in-place version to avoid copy from q to KV. Sink if() into MaybeLogitsSoftCap. PiperOrigin-RevId: 661168418 |
||
|---|---|---|
| .. | ||
| evals | ||
| instantiations | ||
| activations.h | ||
| common.cc | ||
| common.h | ||
| configs.h | ||
| gemma-inl.h | ||
| gemma.cc | ||
| gemma.h | ||
| kv_cache.cc | ||
| kv_cache.h | ||
| run.cc | ||
| tokenizer.cc | ||
| tokenizer.h | ||
| weights.cc | ||
| weights.h | ||