gemma.cpp/util
Biruk Mammo 5a05857deb [Gemma.cpp] Allows non-owned arguments for attention methods.
* Adds and uses a new `AttentionActivationPtrs` that holds non-owning `MatPtrs`. Acts as a view into `AttentionActivations`.
* Updates `QBatch` to hold  non-owning `MatPtr`s to the kv caches.
* Enables the `MatPtrT` default constructor for simpler initializations.
* Pulls out and passes `LayerWeightsPtrs::query_norm_scale` directly. While `LayerWeightsPtrs` already held non-owning `MatPtr`s, this change avoids the need to find and construct several empty weight tensors just to construct one `query_norm_scale` tensor.

PiperOrigin-RevId: 824584177
2025-10-27 10:43:25 -07:00
..
allocator.cc Warning fix (unused var), Windows build fix (missing member variable) 2025-10-21 10:17:34 -07:00
allocator.h Warning fix (unused var), Windows build fix (missing member variable) 2025-10-21 10:17:34 -07:00
args.h Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
basics.cc Replace mt19937 with new generator to enable parallel sampling 2025-09-04 23:49:10 -07:00
basics.h Add 8-bit integer quantization (I8Stream) to Gemma.cpp. 2025-10-15 09:25:20 -07:00
basics_test.cc Replace mt19937 with new generator to enable parallel sampling 2025-09-04 23:49:10 -07:00
mat.cc Add 8-bit integer quantization (I8Stream) to Gemma.cpp. 2025-10-15 09:25:20 -07:00
mat.h [Gemma.cpp] Allows non-owned arguments for attention methods. 2025-10-27 10:43:25 -07:00
test_util.h Minor cleanup/fixes: 2024-09-09 06:58:09 -07:00
threading.cc Major cleanup of profiler zones, add Caller annotation for all pool.Run 2025-10-23 01:54:24 -07:00
threading.h 1.01x speedup: improved autotune 2025-10-27 05:35:31 -07:00
threading_context.cc Major cleanup of profiler zones, add Caller annotation for all pool.Run 2025-10-23 01:54:24 -07:00
threading_context.h 1.02x speedup: improve load balance and simplify parallelFor 2025-10-24 00:19:09 -07:00
threading_test.cc 1.02x speedup: improve load balance and simplify parallelFor 2025-10-24 00:19:09 -07:00
topology.cc Remove multi-package support from topology 2025-10-16 04:00:35 -07:00
topology.h Remove multi-package support from topology 2025-10-16 04:00:35 -07:00
zones.cc Major cleanup of profiler zones, add Caller annotation for all pool.Run 2025-10-23 01:54:24 -07:00
zones.h Major cleanup of profiler zones, add Caller annotation for all pool.Run 2025-10-23 01:54:24 -07:00