gemma.cpp/compression
Jan Wassenberg bd53b0f7c3 Fix MSAN issue for multiturn. Rewind the prior EOS token.
Also move MaybeCheckInitialized to allocator.h

PiperOrigin-RevId: 683187458
2024-10-07 08:07:54 -07:00
..
python Adds insert_float() to SbsWriter() to store a float array directly. 2024-09-12 13:27:24 -07:00
BUILD.bazel Fix MSAN issue for multiturn. Rewind the prior EOS token. 2024-10-07 08:07:54 -07:00
analyze.h Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
blob_store.cc Fix mismatch between blob_store and compress interfaces (bytes) 2024-09-10 10:59:17 -07:00
blob_store.h Fix mismatch between blob_store and compress interfaces (bytes) 2024-09-10 10:59:17 -07:00
compress-inl.h Also enable f64 dot/sum for <f32 inputs 2024-10-04 07:12:10 -07:00
compress.h Fix mismatch between blob_store and compress interfaces (bytes) 2024-09-10 10:59:17 -07:00
compress_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
compress_weights.cc Add support for PaliGemma Vision-LM (224x224) to gemma.cpp 2024-09-23 10:09:38 -07:00
convert_weights.py Cleanup: move util/compress and convert_weights to compression/ 2024-07-05 04:16:52 -07:00
distortion.h Refactor/cleanup, remove even_odd 2024-09-04 09:25:13 -07:00
distortion_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
io.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
io.h Major duplicated code reduction in test/benchmarks 2024-06-14 00:16:25 -07:00
io_win.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
nuq-inl.h Fix MSAN issue for multiturn. Rewind the prior EOS token. 2024-10-07 08:07:54 -07:00
nuq_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
sfp-inl.h Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
sfp_test.cc Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
shared.h Major compression update, arbitrary-len unpack + new Dot 2024-09-10 02:22:19 -07:00
test_util-inl.h Add double-precision dot variant 2024-09-26 12:09:10 -07:00
weights_raw.h Major MatMul update, 1.9-2.3x speedup on Zen4 via bf16 mul 2024-08-16 07:52:20 -07:00