gemma.cpp/evals
Jan Wassenberg b603425bf3 Fix batch inference: dangling reference
Also add more detailed asserts/error messages.

PiperOrigin-RevId: 807695421
2025-09-16 08:01:56 -07:00
..
benchmark.cc Rename GetModelConfig->Config 2025-07-29 10:18:12 -07:00
benchmark_helper.cc Fix batch inference: dangling reference 2025-09-16 08:01:56 -07:00
benchmark_helper.h Fix batch inference: dangling reference 2025-09-16 08:01:56 -07:00
benchmarks.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
cross_entropy.cc 1.15x speedup: parallel sampling, enabled by new RNG 2025-09-05 07:24:02 -07:00
cross_entropy.h Move MatMulEnv out of Gemma to enable concurrent calls 2025-06-23 01:20:09 -07:00
debug_prompt.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
gemma_batch_bench.cc 1.03x speedup: fused FFN 2025-09-15 10:26:37 -07:00
gemma_test.cc Remove Griffin support 2025-09-05 02:35:40 -07:00
prompts.h Benchmark gemma.cpp with different length inputs. 2024-10-10 15:59:26 -07:00
run_mmlu.cc Replace mt19937 with new generator to enable parallel sampling 2025-09-04 23:49:10 -07:00