gemma.cpp

History

Jan Wassenberg 1b72c22345 Refactor Gemma ctor and improve pool NUMA support Gemma receives a MatMulEnv arg, with comment on lifetime Split threading into topology so the latter can be used in allocator Add AllocClasses() for non-POD (ThreadPool) Support binding pool to NUMA node Update threading_test with latency measurements Also update Highway version. PiperOrigin-RevId: 736904748		2025-03-14 10:19:00 -07:00
..
benchmark.cc	Rename ModelTraining to PromptWrapping which is a more accurate name.	2024-12-13 07:45:59 -08:00
benchmark_helper.cc	Refactor Gemma ctor and improve pool NUMA support	2025-03-14 10:19:00 -07:00
benchmark_helper.h	Refactor Gemma ctor and improve pool NUMA support	2025-03-14 10:19:00 -07:00
benchmarks.cc	Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize.	2024-10-14 04:45:21 -07:00
cross_entropy.cc	Tiny cleanup.	2024-12-10 03:34:05 -08:00
cross_entropy.h	Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize.	2024-10-14 04:45:21 -07:00
debug_prompt.cc	Introduce QueryResult in GemmaEnv and add a shortcut for WrapAndTokenize.	2024-10-14 04:45:21 -07:00
gemma_batch_bench.cc	Add a simple benchmark for batching.	2024-11-21 10:59:49 -08:00
gemma_test.cc	Allow interactive use with new single-file weight format.	2025-01-15 07:22:33 -08:00
prompts.h	Benchmark gemma.cpp with different length inputs.	2024-10-10 15:59:26 -07:00
run_mmlu.cc	Make top_k a runtime argument (instead of a model argument).	2024-11-13 09:48:59 -08:00