llama.cpp/examples/duo
Oleksandr Kuvshynov 02e2c91d01 correct split id 2024-05-24 09:52:28 -04:00
..
CMakeLists.txt duo v0 2024-05-21 16:11:30 -04:00
README.md duo: first ~working option 2024-05-22 23:02:31 -04:00
duo.cpp correct split id 2024-05-24 09:52:28 -04:00

README.md

duo

Minimal example. What's not implemented, but can be implemented separately in pieces:

  • tree-based speculation
  • correct sampling
  • support more than 2 instances