gemma.cpp/compression
Jan Wassenberg 4f9155d8c6 Add bf16 matmul support, update naming+test
Avoid int32, which can easily overflow for large matrices.
Also fix IDE warning in sfp-inl.

PiperOrigin-RevId: 640149845
2024-06-04 07:41:46 -07:00
..
BUILD Fix SVE build: add missing hn:: 2024-05-10 06:49:26 -07:00
analyze.h Move code to gemma/ so we can remove error-prone copybara: comments. 2024-04-09 04:45:42 -07:00
blob_store.cc Make BlobWriter::Add() accept const void* 2024-05-17 08:11:06 -07:00
blob_store.h Make BlobWriter::Add() accept const void* 2024-05-17 08:11:06 -07:00
compress-inl.h Add first version of backpropagation support. 2024-06-04 08:37:49 +00:00
compress.h Minor internal refactoring. 2024-05-21 10:29:59 -07:00
distortion.h Update to Highway 1.2 for topology/VQSelect 2024-05-31 12:29:10 -07:00
distortion_test.cc Update distortion.h to weighted average, add distortion_test. 2024-04-17 01:44:19 -07:00
io.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
io.h Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
io_win.cc Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
nuq-inl.h Fix underflow in NUQ ClusterCost() 2024-04-25 11:28:51 -07:00
nuq.h initial commit 2024-02-21 03:31:22 +00:00
nuq_test.cc Update distortion.h to weighted average, add distortion_test. 2024-04-17 01:44:19 -07:00
sfp-inl.h Add bf16 matmul support, update naming+test 2024-06-04 07:41:46 -07:00
sfp.h initial commit 2024-02-21 03:31:22 +00:00
sfp_test.cc Enable even/odd for SFP. Refs #166 2024-05-08 07:09:06 -07:00
stats.cc Move code to gemma/ so we can remove error-prone copybara: comments. 2024-04-09 04:45:42 -07:00
stats.h Further improve IO, enable multiple backends without -D. 2024-04-19 00:40:29 -07:00
test_util.h Update distortion.h to weighted average, add distortion_test. 2024-04-17 01:44:19 -07:00