gemma.cpp/util
Jan Wassenberg 7263ab8445 MatMul simplification, threading strategy improvements
remove MatMul f32 special case (smaller code),
types: Add u32/u64 for use by Activations
move renamed ParallelismStrategy to threading_context so can pass ctx
ensure worker index is unique across clusters
matmul.h: const member functions for renamed policy classes (easier to call)
PiperOrigin-RevId: 802848086
2025-09-03 21:45:07 -07:00
..
allocator.cc Cleanup: remove unused kCyclic, remove 2 suffix 2025-05-13 01:06:41 -07:00
allocator.h 1.31x batch prefill, 1.24x batch decode speedup: NUMA binding 2025-05-16 07:42:13 -07:00
args.h Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
basics.h MatMul simplification, threading strategy improvements 2025-09-03 21:45:07 -07:00
mat.cc De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
mat.h De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
test_util.h Minor cleanup/fixes: 2024-09-09 06:58:09 -07:00
threading.cc (Resubmit) Prepare profiler annotations for new API 2025-08-13 01:38:24 -07:00
threading.h MatMul simplification, threading strategy improvements 2025-09-03 21:45:07 -07:00
threading_context.cc (Resubmit) Prepare profiler annotations for new API 2025-08-13 01:38:24 -07:00
threading_context.h MatMul simplification, threading strategy improvements 2025-09-03 21:45:07 -07:00
threading_test.cc De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
topology.cc Avoid affinity related warnings on Apple. Refs #625 2025-07-03 08:22:31 -07:00
topology.h Fix thread name when skipping packages/clusters 2025-06-01 23:50:11 -07:00