mirror of https://github.com/google/gemma.cpp.git
- Allocate only the required KV caches and activation batch size - Add flags for batch sizes - Const-correct interface: Span of const int. - Also clean up the KVCache arg to a span. - Move kPrefillBatchSize into RuntimeConfig and remove related global constants. PiperOrigin-RevId: 655893197 |
||
|---|---|---|
| .. | ||
| hello_world | ||
| README.md | ||
README.md
Examples
In this directory are some simple examples illustrating usage of gemma.cpp as
a library beyond the interactive gemma app implemented in run.cc.
hello_world/- minimal/template project for usinggemma.cppas a library. It sets up the model state and generates text for a single hard coded prompt.