gemma.cpp/examples
Jan Wassenberg aaf51898b6 Major revamp #2 of Prefill: fix token order, parallel for multi-query
- Allocate only the required KV caches and activation batch size
- Add flags for batch sizes
- Const-correct interface: Span of const int.
- Also clean up the KVCache arg to a span.
- Move kPrefillBatchSize into RuntimeConfig and remove related global constants.

PiperOrigin-RevId: 655893197
2024-07-25 03:28:55 -07:00
..
hello_world Major revamp #2 of Prefill: fix token order, parallel for multi-query 2024-07-25 03:28:55 -07:00
README.md Update README with more details on contributing code, add experimental/ directory, add READMEs for subdirectories, clean up DEVELOPER notes 2024-03-15 14:10:24 -04:00

README.md

Examples

In this directory are some simple examples illustrating usage of gemma.cpp as a library beyond the interactive gemma app implemented in run.cc.

  • hello_world/ - minimal/template project for using gemma.cpp as a library. It sets up the model state and generates text for a single hard coded prompt.