llama.cpp/examples/duo
Oleksandr Kuvshynov 60fe62e6eb some renaming 2024-05-22 23:52:36 -04:00
..
CMakeLists.txt duo v0 2024-05-21 16:11:30 -04:00
README.md duo: first ~working option 2024-05-22 23:02:31 -04:00
duo.cpp some renaming 2024-05-22 23:52:36 -04:00

README.md

duo

Minimal example. What's not implemented, but can be implemented separately in pieces:

  • tree-based speculation
  • correct sampling
  • support more than 2 instances