| .. |
|
python
|
Add Python code for converting Griffin Orbax weights. Refs #301
|
2024-07-29 12:53:30 -07:00 |
|
BUILD
|
Further nuq_test speedups to prevent timeout
|
2024-09-04 00:49:44 -07:00 |
|
analyze.h
|
Remove no longer required stats.h - use Highway version instead
|
2024-06-05 01:37:48 -07:00 |
|
blob_store.cc
|
Make BlobWriter::Add() accept const void*
|
2024-05-17 08:11:06 -07:00 |
|
blob_store.h
|
Make BlobWriter::Add() accept const void*
|
2024-05-17 08:11:06 -07:00 |
|
compress-inl.h
|
Avoid duplication of RMSNorm, support all activation/weight types
|
2024-08-28 01:26:55 -07:00 |
|
compress.h
|
0.98x prefill: refactor in prep for cache blocking.
|
2024-08-12 09:26:29 -07:00 |
|
compress_weights.cc
|
Refactor configurables.
|
2024-07-10 21:30:58 -07:00 |
|
convert_weights.py
|
Cleanup: move util/compress and convert_weights to compression/
|
2024-07-05 04:16:52 -07:00 |
|
distortion.h
|
Experiment with compensated dot product.
|
2024-08-27 12:05:35 -07:00 |
|
distortion_test.cc
|
Avoid duplication of RMSNorm, support all activation/weight types
|
2024-08-28 01:26:55 -07:00 |
|
io.cc
|
Further improve IO, enable multiple backends without -D.
|
2024-04-19 00:40:29 -07:00 |
|
io.h
|
Major duplicated code reduction in test/benchmarks
|
2024-06-14 00:16:25 -07:00 |
|
io_win.cc
|
Further improve IO, enable multiple backends without -D.
|
2024-04-19 00:40:29 -07:00 |
|
nuq-inl.h
|
Fix NUQ for SVE - incorrect nibble packing
|
2024-09-03 10:59:01 -07:00 |
|
nuq.h
|
initial commit
|
2024-02-21 03:31:22 +00:00 |
|
nuq_test.cc
|
Further nuq_test speedups to prevent timeout
|
2024-09-04 00:49:44 -07:00 |
|
sfp-inl.h
|
Experiment with compensated dot product.
|
2024-08-27 12:05:35 -07:00 |
|
sfp.h
|
initial commit
|
2024-02-21 03:31:22 +00:00 |
|
sfp_test.cc
|
Avoid duplication of RMSNorm, support all activation/weight types
|
2024-08-28 01:26:55 -07:00 |
|
weights_raw.h
|
Major MatMul update, 1.9-2.3x speedup on Zen4 via bf16 mul
|
2024-08-16 07:52:20 -07:00 |