llama.cpp/examples/duo/README.md

7 lines
172 B
Markdown

## duo
Minimal example. What's not implemented, but can be implemented separately in pieces:
* tree-based speculation
* correct sampling
* support more than 2 instances
*