gemma.cpp/evals
Jan Wassenberg 1b72c22345 Refactor Gemma ctor and improve pool NUMA support
Gemma receives a MatMulEnv arg, with comment on lifetime
Split threading into topology so the latter can be used in allocator
Add AllocClasses() for non-POD (ThreadPool)
Support binding pool to NUMA node
Update threading_test with latency measurements
Also update Highway version.

PiperOrigin-RevId: 736904748
2025-03-14 10:19:00 -07:00
..
benchmark.cc Rename ModelTraining to PromptWrapping which is a more accurate name. 2024-12-13 07:45:59 -08:00
benchmark_helper.cc Refactor Gemma ctor and improve pool NUMA support 2025-03-14 10:19:00 -07:00
benchmark_helper.h Refactor Gemma ctor and improve pool NUMA support 2025-03-14 10:19:00 -07:00
benchmarks.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
cross_entropy.cc Tiny cleanup. 2024-12-10 03:34:05 -08:00
cross_entropy.h Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
debug_prompt.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
gemma_batch_bench.cc Add a simple benchmark for batching. 2024-11-21 10:59:49 -08:00
gemma_test.cc Allow interactive use with new single-file weight format. 2025-01-15 07:22:33 -08:00
prompts.h Benchmark gemma.cpp with different length inputs. 2024-10-10 15:59:26 -07:00
run_mmlu.cc Make top_k a runtime argument (instead of a model argument). 2024-11-13 09:48:59 -08:00