gemma.cpp/backprop
Jan Wassenberg 02ce1e344f Use NestedPools, add NUMA infra
Improved threading.h, fix thread counts for single package/cluster systems
Temporarily forces to a single socket. Prefill 29.28 tps, decode 6.92.

Also fix benchmarks.cc build, update tensor allocator to Allocator

PiperOrigin-RevId: 687307167
2024-10-18 08:11:18 -07:00
..
activations.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward-inl.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward_scalar.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward_scalar_test.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
backward_test.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
common_scalar.h Added MatPtr/MatPtrT/MatStorageT/MatStorage as a dynamically-sized replacement for CompressedArray. 2024-10-10 08:22:30 -07:00
forward-inl.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
forward.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
forward.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
forward_scalar.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
optimize_test.cc Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
optimizer.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
optimizer.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
prompt.h Add missing include 2024-06-04 10:29:12 +00:00
sampler.h Add config for att/final cap, skip max-subtract. Fixes #278 2024-07-01 09:45:26 -07:00
test_util.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00