gemma.cpp/evals
Jan Wassenberg 2d14d796e3 1.09x decode speedup for topk=1/temp0: fuse softmax and sample
PiperOrigin-RevId: 680589099
2024-09-30 08:37:41 -07:00
..
benchmark.cc Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00
benchmark_helper.cc Fix topology display for platforms where it fails (Apple) 2024-09-23 08:14:54 -07:00
benchmark_helper.h Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00
benchmarks.cc Move benchmark_helper to evals/, weights_raw to compression/. 2024-07-08 01:13:23 -07:00
cross_entropy.cc 1.09x decode speedup for topk=1/temp0: fuse softmax and sample 2024-09-30 08:37:41 -07:00
cross_entropy.h Declutter gemma/ directory, move binaries to evals/ and util/. 2024-07-01 09:51:04 -07:00
debug_prompt.cc Extend LayersOutputFunc to take query index and auxillary int 2024-07-30 06:53:56 -07:00
gemma_test.cc Update expected entropy values for GRIFFIN_2B model. 2024-09-24 02:12:59 -07:00
run_mmlu.cc Add an additional QueryModel() overload to GemmaEnv. 2024-09-03 02:25:19 -07:00