| .. |
|
evals
|
Add MMLU eval to github
|
2024-05-20 10:20:53 -07:00 |
|
instantiations
|
Eliminated TConfig.
|
2024-10-17 05:04:22 -07:00 |
|
activations.h
|
Infra improvements (2)
|
2025-01-23 01:55:19 -08:00 |
|
common.cc
|
Rename ModelTraining to PromptWrapping which is a more accurate name.
|
2024-12-13 07:45:59 -08:00 |
|
common.h
|
Rename ModelTraining to PromptWrapping which is a more accurate name.
|
2024-12-13 07:45:59 -08:00 |
|
configs.cc
|
Windows build fixes: struct vs class, unused arg/var, avoid VLA, Deleter arg, casts
|
2025-02-07 07:38:55 -08:00 |
|
configs.h
|
Windows build fixes: struct vs class, unused arg/var, avoid VLA, Deleter arg, casts
|
2025-02-07 07:38:55 -08:00 |
|
configs_test.cc
|
Moved the vit config fields to their own config struct
|
2025-01-15 01:09:49 -08:00 |
|
gemma-inl.h
|
Implements FusedSoftmaxAndSampleTopK.
|
2025-02-16 21:30:06 -08:00 |
|
gemma.cc
|
Infra improvements (2)
|
2025-01-23 01:55:19 -08:00 |
|
gemma.h
|
Matmul rewrite: fp64 sums, hierarchical parallelization, cache-blocking, autotuning
|
2025-02-20 08:33:46 -08:00 |
|
kv_cache.cc
|
Add KVCache.DeepCopy() . Will be useful for implementing sampling functionality like beam sampling, parallel sampling, CoT Decoding (à la https://arxiv.org/abs/2402.10200)
|
2025-02-10 04:10:29 -08:00 |
|
kv_cache.h
|
Add KVCache.DeepCopy() . Will be useful for implementing sampling functionality like beam sampling, parallel sampling, CoT Decoding (à la https://arxiv.org/abs/2402.10200)
|
2025-02-10 04:10:29 -08:00 |
|
run.cc
|
Matmul rewrite: fp64 sums, hierarchical parallelization, cache-blocking, autotuning
|
2025-02-20 08:33:46 -08:00 |
|
tensor_index.cc
|
Moved the vit config fields to their own config struct
|
2025-01-15 01:09:49 -08:00 |
|
tensor_index.h
|
Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future.
|
2024-12-11 01:26:35 -08:00 |
|
tensor_index_test.cc
|
Moved the vit config fields to their own config struct
|
2025-01-15 01:09:49 -08:00 |
|
tokenizer.cc
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
tokenizer.h
|
Added ability to load/save a complete model file, including tokenizer.
|
2024-12-19 07:59:41 -08:00 |
|
weights.cc
|
Fix the link error when building `compress_weights` with Clang on macOS
|
2025-02-09 00:13:25 +08:00 |
|
weights.h
|
Fix the link error when building `compress_weights` with Clang on macOS
|
2025-02-09 00:13:25 +08:00 |