mirror of https://github.com/google/gemma.cpp.git
Initial Home page
commit
2d50636b74
|
|
@ -0,0 +1,18 @@
|
||||||
|
Incremental Library Improvements
|
||||||
|
|
||||||
|
- MQA Implementation for 2B models
|
||||||
|
- Refactor terminal UI code out of `run.cc` into a reusable header
|
||||||
|
- Toy example of an alternative model in examples/
|
||||||
|
|
||||||
|
Refactoring
|
||||||
|
|
||||||
|
- Pub/Sub message queue as interface between application and runtime for I/O, introspection, error handling
|
||||||
|
- Allow model changes at runtime
|
||||||
|
|
||||||
|
Google workflow Items
|
||||||
|
|
||||||
|
- Incorporate copybara:end into copybara transformations
|
||||||
|
|
||||||
|
Features
|
||||||
|
|
||||||
|
- %save / %load KV cache
|
||||||
Loading…
Reference in New Issue