mirror of https://github.com/google/gemma.cpp.git
Improved threading.h, fix thread counts for single package/cluster systems Temporarily forces to a single socket. Prefill 29.28 tps, decode 6.92. Also fix benchmarks.cc build, update tensor allocator to Allocator PiperOrigin-RevId: 687307167 |
||
|---|---|---|
| .. | ||
| dot-inl.h | ||
| dot_test.cc | ||
| fp_arith-inl.h | ||
| gemma_matvec_test.cc | ||
| matmul-inl.h | ||
| matmul.h | ||
| matmul_test.cc | ||
| matvec-inl.h | ||
| ops-inl.h | ||
| ops_test.cc | ||
| sum-inl.h | ||