gemma.cpp/ops
Jan Wassenberg 02ce1e344f Use NestedPools, add NUMA infra
Improved threading.h, fix thread counts for single package/cluster systems
Temporarily forces to a single socket. Prefill 29.28 tps, decode 6.92.

Also fix benchmarks.cc build, update tensor allocator to Allocator

PiperOrigin-RevId: 687307167
2024-10-18 08:11:18 -07:00
..
dot-inl.h Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
dot_test.cc Update expected ranges in dot_test. 2024-10-13 23:47:20 -07:00
fp_arith-inl.h Cascaded summation for Softmax 2024-09-20 10:31:23 -07:00
gemma_matvec_test.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
matmul-inl.h Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
matmul.h Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
matmul_test.cc Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
matvec-inl.h Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
ops-inl.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
ops_test.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
sum-inl.h Minor cleanup, Windows+Bazel build fixes 2024-10-10 09:05:06 -07:00