gemma.cpp/gemma
Ray Smith 0d68555f87 Eliminated TConfig.
Changed CompressedLayer and CompressedWeights to be constructed with an instance of a LayerConfig and WeightsConfig respectively.
Added CompressedModel to remove ByteStorageT and get rid of most of the type casting, as well as allowing the default destructor to be used and work properly.
Adjusted WeightsWrapper and ForwardLayer etc to match.
The only remaining template arg is the weight type.
This enables all the instantiations to be deleted, apart from one per type.
It also enables (but not yet done) the config to be stored in the blob file instead of having to be specified separately.
Reduces the size of the gemma_lib and weights shared libraries by a factor of 4.3 and 3.2 respectively.

PiperOrigin-RevId: 686870060
2024-10-17 05:04:22 -07:00
..
evals Add MMLU eval to github 2024-05-20 10:20:53 -07:00
instantiations Eliminated TConfig. 2024-10-17 05:04:22 -07:00
activations.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
common.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
common.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
configs.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
configs.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
configs_test.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
gemma-inl.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
gemma.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
gemma.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
kv_cache.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
kv_cache.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00
run.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
tokenizer.cc Add support for PaliGemma Vision-LM (224x224) to gemma.cpp 2024-09-23 10:09:38 -07:00
tokenizer.h 7x compile time speedup: shard gemma.cc 2024-07-03 06:35:04 -07:00
weights.cc Eliminated TConfig. 2024-10-17 05:04:22 -07:00
weights.h Eliminated TConfig. 2024-10-17 05:04:22 -07:00