llama.cpp/examples/duo
Oleksandr Kuvshynov 10d5aefed5 logging 2024-05-24 22:21:41 -04:00
..
CMakeLists.txt duo v0 2024-05-21 16:11:30 -04:00
README.md fixes 2024-05-24 12:22:59 -04:00
duo.cpp logging 2024-05-24 22:21:41 -04:00

README.md

duo

Minimal example. What's not implemented, but can be implemented separately in pieces:

  • tree-based speculation
  • correct sampling
  • support more than 2 instances
  • just one instance speculates