gemma.cpp/evals
Jan Wassenberg 02ce1e344f Use NestedPools, add NUMA infra
Improved threading.h, fix thread counts for single package/cluster systems
Temporarily forces to a single socket. Prefill 29.28 tps, decode 6.92.

Also fix benchmarks.cc build, update tensor allocator to Allocator

PiperOrigin-RevId: 687307167
2024-10-18 08:11:18 -07:00
..
benchmark.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
benchmark_helper.cc Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
benchmark_helper.h Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
benchmarks.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
cross_entropy.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
cross_entropy.h Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
debug_prompt.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
gemma_test.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00
prompts.h Benchmark gemma.cpp with different length inputs. 2024-10-10 15:59:26 -07:00
run_mmlu.cc Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize. 2024-10-14 04:45:21 -07:00