gemma.cpp/io
Jan Wassenberg cf4d7ceb82 1.16x decode speedup: remove last MatVec in Attention
Precompute row pointers.
Remove no longer used MHA support; QStride -> qkv_dim.
Remove RowPtr from MatMul interface, use only MatPtrT.
Require opt-in define for NUQ to speed up builds.
Also fix io.cc on Windows.

PiperOrigin-RevId: 766228108
2025-06-02 09:40:29 -07:00
..
BUILD.bazel Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
blob_compare.cc 3.8x speedup of weights loading via preadv on Linux 2025-05-15 11:55:15 -07:00
blob_store.cc Fix RowT issue and improve Griffin (currently still broken) 2025-05-19 07:02:10 -07:00
blob_store.h Fix RowT issue and improve Griffin (currently still broken) 2025-05-19 07:02:10 -07:00
blob_store_test.cc 3.8x speedup of weights loading via preadv on Linux 2025-05-15 11:55:15 -07:00
fields.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
fields.h Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
fields_test.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
io.cc 1.16x decode speedup: remove last MatVec in Attention 2025-06-02 09:40:29 -07:00
io.h 3.8x speedup of weights loading via preadv on Linux 2025-05-15 11:55:15 -07:00
io_win.cc Cleanup: remove unused kCyclic, remove 2 suffix 2025-05-13 01:06:41 -07:00
migrate_weights.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00