gemma.cpp/compression
Jan Wassenberg 45ad847a41 Replace RowVectorBatch with MatStorageT
KVCache: add ctor required for MatStorageT, remove Create; bf_pre_ffw_rms_out -> pre_ffw_rms_out
optimize_test: larger vocab_size requires more steps
shared.h: Remove unused u128 type
correctly set Activation matrix rows, avoid passing as arg
ops: pass Mat instead of pointers/sizes; vectorize LayerNorm; support any weight type
mat: add OverrideRows, used by SetBatchSize
PiperOrigin-RevId: 757790736
2025-05-12 09:16:12 -07:00
..
python Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
BUILD.bazel Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
analyze.h Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
compress-inl.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
compress.cc Minor cleanup, on-demand NUQ buffer allocation 2025-04-16 10:49:43 -07:00
compress.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
compress_test.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
distortion.h Refactor/cleanup, remove even_odd 2024-09-04 09:25:13 -07:00
distortion_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
nuq-inl.h Fix nuq Enc() to handle groups < kGroupSize. 2025-02-10 07:17:59 -08:00
nuq_test.cc Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
sfp-inl.h Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
sfp_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
shared.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
test_util-inl.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00