| .. |
|
python
|
Allow conversion, loading and inference with NUQ.
|
2025-02-05 07:45:54 -08:00 |
|
BUILD.bazel
|
Improved blob diff: parallel, tolerance for float
|
2025-02-06 13:46:28 -08:00 |
|
analyze.h
|
Major compression update, arbitrary-len unpack + new Dot
|
2024-09-10 02:22:19 -07:00 |
|
blob_compare.cc
|
Improved blob diff: parallel, tolerance for float
|
2025-02-06 13:46:28 -08:00 |
|
blob_store.cc
|
Expose BlobReader::Keys()
|
2024-11-07 10:28:39 -08:00 |
|
blob_store.h
|
Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future.
|
2024-12-11 01:26:35 -08:00 |
|
blob_store_test.cc
|
Expose BlobReader::Keys()
|
2024-11-07 10:28:39 -08:00 |
|
compress-inl.h
|
Infra improvements (2)
|
2025-01-23 01:55:19 -08:00 |
|
compress.cc
|
Added MatPtr/MatPtrT/MatStorageT/MatStorage as a dynamically-sized replacement for CompressedArray.
|
2024-10-10 08:22:30 -07:00 |
|
compress.h
|
Allow conversion, loading and inference with NUQ.
|
2025-02-05 07:45:54 -08:00 |
|
compress_test.cc
|
Allow conversion, loading and inference with NUQ.
|
2025-02-05 07:45:54 -08:00 |
|
compress_weights.cc
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
convert_weights.py
|
Cleanup: move util/compress and convert_weights to compression/
|
2024-07-05 04:16:52 -07:00 |
|
distortion.h
|
Refactor/cleanup, remove even_odd
|
2024-09-04 09:25:13 -07:00 |
|
distortion_test.cc
|
Major compression update, arbitrary-len unpack + new Dot
|
2024-09-10 02:22:19 -07:00 |
|
fields.cc
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
fields.h
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
fields_test.cc
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
io.cc
|
Further improve IO, enable multiple backends without -D.
|
2024-04-19 00:40:29 -07:00 |
|
io.h
|
Major duplicated code reduction in test/benchmarks
|
2024-06-14 00:16:25 -07:00 |
|
io_win.cc
|
Further improve IO, enable multiple backends without -D.
|
2024-04-19 00:40:29 -07:00 |
|
migrate_weights.cc
|
Allow interactive use with new single-file weight format.
|
2025-01-15 07:22:33 -08:00 |
|
nuq-inl.h
|
Base interleaved handling for 4.5-bit NUQ, specifically Enc, DecompressAndZeroPad, and Dec2. Includes tests.
|
2025-01-31 10:35:32 -08:00 |
|
nuq_test.cc
|
Base interleaved handling for 4.5-bit NUQ, specifically Enc, DecompressAndZeroPad, and Dec2. Includes tests.
|
2025-01-31 10:35:32 -08:00 |
|
sfp-inl.h
|
Major compression update, arbitrary-len unpack + new Dot
|
2024-09-10 02:22:19 -07:00 |
|
sfp_test.cc
|
Major compression update, arbitrary-len unpack + new Dot
|
2024-09-10 02:22:19 -07:00 |
|
shared.h
|
Allow conversion, loading and inference with NUQ.
|
2025-02-05 07:45:54 -08:00 |
|
test_util-inl.h
|
Add double-precision dot variant
|
2024-09-26 12:09:10 -07:00 |