gemma.cpp/evals
Jan Wassenberg 2c28b18eb0 Add NestedPools: one per socket/cluster
Use in dot_test
app.h: add new flags and rename num_threads to max_threads
matmul: Parallelize MatMulSlow and enable spinning, more large/fewer medium test cases
PiperOrigin-RevId: 683216386
2024-10-07 09:40:19 -07:00
..
benchmark.cc Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00
benchmark_helper.cc Add NestedPools: one per socket/cluster 2024-10-07 09:40:19 -07:00
benchmark_helper.h Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00
benchmarks.cc Move benchmark_helper to evals/, weights_raw to compression/. 2024-07-08 01:13:23 -07:00
cross_entropy.cc 1.09x decode speedup for topk=1/temp0: fuse softmax and sample 2024-09-30 08:37:41 -07:00
cross_entropy.h Declutter gemma/ directory, move binaries to evals/ and util/. 2024-07-01 09:51:04 -07:00
debug_prompt.cc Extend LayersOutputFunc to take query index and auxillary int 2024-07-30 06:53:56 -07:00
gemma_test.cc Update expected entropy values for GRIFFIN_2B model. 2024-09-24 02:12:59 -07:00
run_mmlu.cc Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00