mirror of https://github.com/google/gemma.cpp.git
- Allocate only the required KV caches and activation batch size - Add flags for batch sizes - Const-correct interface: Span of const int. - Also clean up the KVCache arg to a span. - Move kPrefillBatchSize into RuntimeConfig and remove related global constants. PiperOrigin-RevId: 655893197 |
||
|---|---|---|
| .. | ||
| benchmark.cc | ||
| benchmark_helper.cc | ||
| benchmark_helper.h | ||
| benchmarks.cc | ||
| cross_entropy.cc | ||
| cross_entropy.h | ||
| debug_prompt.cc | ||
| gemma_test.cc | ||
| run_mmlu.cc | ||