Commit Graph

22 Commits

Author SHA1 Message Date
Georgi Gerganov 2ffa45edfc
add tokens 2026-02-16 21:52:54 +02:00
Georgi Gerganov 9c29be1177
store full response 2026-02-16 21:44:29 +02:00
Georgi Gerganov 013963cfd5
add html 2026-02-16 21:22:06 +02:00
Georgi Gerganov e2e998a2d6
fix prompts 2026-02-16 21:02:25 +02:00
Georgi Gerganov 6c41664b8b
simplify 2026-02-16 19:50:27 +02:00
Georgi Gerganov 7b84af8051
fix counts 2026-02-16 16:38:31 +02:00
Georgi Gerganov 60a501e138
cleanup 2026-02-16 16:31:14 +02:00
Georgi Gerganov e6e777cfb3
resume eval 2026-02-16 16:21:36 +02:00
Georgi Gerganov ad3a54eb68
ignore errors 2026-02-16 15:23:23 +02:00
Georgi Gerganov de956a6ca8
cleanup 2026-02-16 12:02:16 +02:00
Georgi Gerganov 350e7c1409
datasets : fix aime2025 2026-02-16 11:55:57 +02:00
Georgi Gerganov db10dda1f3
grade : improve regex + logs 2026-02-16 11:51:36 +02:00
Georgi Gerganov 52759bf078
grader : update prompt 2026-02-16 11:17:53 +02:00
Georgi Gerganov 99e3c3d02c
datasets : add aime2025 2026-02-16 11:07:54 +02:00
Georgi Gerganov c6315655b7
cont 2026-02-16 10:56:58 +02:00
Georgi Gerganov f762a71d56
grader : improve example answers 2026-02-16 10:51:41 +02:00
Georgi Gerganov 73e61d5b75
rename 2026-02-16 10:30:10 +02:00
Georgi Gerganov 1db8428f00
remove old files 2026-02-15 22:16:54 +02:00
gatbontonpc 8839037528
add checkpointing 2026-02-15 21:08:22 +02:00
gatbontonpc 89cab3dbc5
Add readme 2026-02-15 21:08:22 +02:00
gatbontonpc c2d83ca048
multi source llama-eval 2026-02-15 21:08:22 +02:00
gatbontonpc c05df17ce3
working llama-eval mc and math suite 2026-02-15 21:08:19 +02:00