gemma.cpp/util
Jan Wassenberg b7b3d353db Simplify MatMul: remove F32 special case (build time)
Also move kMaxM into separate kMaxBatchSize

PiperOrigin-RevId: 802086590
2025-09-02 04:29:21 -07:00
..
allocator.cc Cleanup: remove unused kCyclic, remove 2 suffix 2025-05-13 01:06:41 -07:00
allocator.h 1.31x batch prefill, 1.24x batch decode speedup: NUMA binding 2025-05-16 07:42:13 -07:00
args.h Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
basics.h Simplify MatMul: remove F32 special case (build time) 2025-09-02 04:29:21 -07:00
mat.cc De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
mat.h De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
test_util.h Minor cleanup/fixes: 2024-09-09 06:58:09 -07:00
threading.cc (Resubmit) Prepare profiler annotations for new API 2025-08-13 01:38:24 -07:00
threading.h Add ParallelFor wrapper function and one new mode 2025-09-02 01:40:09 -07:00
threading_context.cc (Resubmit) Prepare profiler annotations for new API 2025-08-13 01:38:24 -07:00
threading_context.h Expand ThreadingContext comments 2025-08-28 08:32:10 -07:00
threading_test.cc De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
topology.cc Avoid affinity related warnings on Apple. Refs #625 2025-07-03 08:22:31 -07:00
topology.h Fix thread name when skipping packages/clusters 2025-06-01 23:50:11 -07:00