..
evals
Add MMLU eval to github
2024-05-20 10:20:53 -07:00
instantiations
Eliminated TConfig.
2024-10-17 05:04:22 -07:00
activations.h
Infra improvements (2)
2025-01-23 01:55:19 -08:00
common.cc
Rename ModelTraining to PromptWrapping which is a more accurate name.
2024-12-13 07:45:59 -08:00
common.h
Rename ModelTraining to PromptWrapping which is a more accurate name.
2024-12-13 07:45:59 -08:00
configs.cc
Windows build fixes: struct vs class, unused arg/var, avoid VLA, Deleter arg, casts
2025-02-07 07:38:55 -08:00
configs.h
Windows build fixes: struct vs class, unused arg/var, avoid VLA, Deleter arg, casts
2025-02-07 07:38:55 -08:00
configs_test.cc
Moved the vit config fields to their own config struct
2025-01-15 01:09:49 -08:00
gemma-inl.h
Factor out DecodeStepT from GenerateT into a separate function.
2025-02-10 03:53:08 -08:00
gemma.cc
Infra improvements (2)
2025-01-23 01:55:19 -08:00
gemma.h
Infra improvements (2)
2025-01-23 01:55:19 -08:00
kv_cache.cc
Added ability to load/save a complete model file, including tokenizer.
2024-12-19 07:59:41 -08:00
kv_cache.h
Fix Griffin model:
2024-11-08 08:30:53 -08:00
run.cc
Moved the vit config fields to their own config struct
2025-01-15 01:09:49 -08:00
tensor_index.cc
Moved the vit config fields to their own config struct
2025-01-15 01:09:49 -08:00
tensor_index.h
Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future.
2024-12-11 01:26:35 -08:00
tensor_index_test.cc
Moved the vit config fields to their own config struct
2025-01-15 01:09:49 -08:00
tokenizer.cc
Added ability to load/save a complete model file, including tokenizer.
2024-12-19 07:59:41 -08:00
tokenizer.h
Added ability to load/save a complete model file, including tokenizer.
2024-12-19 07:59:41 -08:00
weights.cc
Allow conversion, loading and inference with NUQ.
2025-02-05 07:45:54 -08:00
weights.h
Windows build fixes: struct vs class, unused arg/var, avoid VLA, Deleter arg, casts
2025-02-07 07:38:55 -08:00