gemma.cpp/examples
Jan Wassenberg 5e433e774a 1.1x prefill speedup, revamp threading in preparation for hierarchical parallelism.
Limit thread counts to detected. Add max_clusters arg.
Update detection logic to check for smt0 - previously we pinned to some siblings.

PiperOrigin-RevId: 659755311
2024-08-05 18:50:09 -07:00
..
hello_world 1.1x prefill speedup, revamp threading in preparation for hierarchical parallelism. 2024-08-05 18:50:09 -07:00
README.md Update README with more details on contributing code, add experimental/ directory, add READMEs for subdirectories, clean up DEVELOPER notes 2024-03-15 14:10:24 -04:00

README.md

Examples

In this directory are some simple examples illustrating usage of gemma.cpp as a library beyond the interactive gemma app implemented in run.cc.

  • hello_world/ - minimal/template project for using gemma.cpp as a library. It sets up the model state and generates text for a single hard coded prompt.