gemma.cpp/compression
Jan Wassenberg 07c34cb18a Further nuq_test speedups to prevent timeout
PiperOrigin-RevId: 670863385
2024-09-04 00:49:44 -07:00
..
python Add Python code for converting Griffin Orbax weights. Refs #301 2024-07-29 12:53:30 -07:00
BUILD Further nuq_test speedups to prevent timeout 2024-09-04 00:49:44 -07:00
analyze.h Remove no longer required stats.h - use Highway version instead 2024-06-05 01:37:48 -07:00
blob_store.cc Make BlobWriter::Add() accept const void* 2024-05-17 08:11:06 -07:00
blob_store.h Make BlobWriter::Add() accept const void* 2024-05-17 08:11:06 -07:00
compress-inl.h Avoid duplication of RMSNorm, support all activation/weight types 2024-08-28 01:26:55 -07:00
compress.h 0.98x prefill: refactor in prep for cache blocking. 2024-08-12 09:26:29 -07:00
compress_weights.cc Refactor configurables. 2024-07-10 21:30:58 -07:00
convert_weights.py Cleanup: move util/compress and convert_weights to compression/ 2024-07-05 04:16:52 -07:00
distortion.h Experiment with compensated dot product. 2024-08-27 12:05:35 -07:00
distortion_test.cc Avoid duplication of RMSNorm, support all activation/weight types 2024-08-28 01:26:55 -07:00
io.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
io.h Major duplicated code reduction in test/benchmarks 2024-06-14 00:16:25 -07:00
io_win.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
nuq-inl.h Fix NUQ for SVE - incorrect nibble packing 2024-09-03 10:59:01 -07:00
nuq.h initial commit 2024-02-21 03:31:22 +00:00
nuq_test.cc Further nuq_test speedups to prevent timeout 2024-09-04 00:49:44 -07:00
sfp-inl.h Experiment with compensated dot product. 2024-08-27 12:05:35 -07:00
sfp.h initial commit 2024-02-21 03:31:22 +00:00
sfp_test.cc Avoid duplication of RMSNorm, support all activation/weight types 2024-08-28 01:26:55 -07:00
weights_raw.h Major MatMul update, 1.9-2.3x speedup on Zen4 via bf16 mul 2024-08-16 07:52:20 -07:00