gemma.cpp/gemma
Ray Smith e69bc3bc1c Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future.
Corrected some errors in the TensorIndex.

PiperOrigin-RevId: 705014619
2024-12-11 01:26:35 -08:00
..
evals Add MMLU eval to github 2024-05-20 10:20:53 -07:00
instantiations Eliminated TConfig. 2024-10-17 05:04:22 -07:00
activations.h Simpler MatMul interface, vocab types, Tristate for use_spinning 2024-11-04 07:48:29 -08:00
common.cc Merge pull request #460 from ericcurtin:common 2024-12-10 06:33:37 -08:00
common.h Fix PaliGemma's GenerateImageTokensT(). 2024-10-18 01:34:13 -07:00
configs.cc Add support for 448px resolution to PaliGemma and PaliGemma2. 2024-12-09 11:38:10 -08:00
configs.h Add support for 448px resolution to PaliGemma and PaliGemma2. 2024-12-09 11:38:10 -08:00
configs_test.cc Improved consistency of compressor API, and added a universal method with a target type arg. 2024-11-21 05:27:40 -08:00
gemma-inl.h Make top_k a runtime argument (instead of a model argument). 2024-11-13 09:48:59 -08:00
gemma.cc Simpler MatMul interface, vocab types, Tristate for use_spinning 2024-11-04 07:48:29 -08:00
gemma.h Make top_k a runtime argument (instead of a model argument). 2024-11-13 09:48:59 -08:00
kv_cache.cc Fix Griffin model: 2024-11-08 08:30:53 -08:00
kv_cache.h Fix Griffin model: 2024-11-08 08:30:53 -08:00
run.cc Add support for 448px resolution to PaliGemma and PaliGemma2. 2024-12-09 11:38:10 -08:00
tensor_index.cc Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future. 2024-12-11 01:26:35 -08:00
tensor_index.h Added the TensorInfo arg to the compressor so the shape and scale can be output correctly to the file in future. 2024-12-11 01:26:35 -08:00
tensor_index_test.cc Added tensor_index as a single source of truth on tensor shapes/sources and transformations 2024-11-19 00:25:39 -08:00
tokenizer.cc Factor out addition of ViTConfig to a ModelConfig. 2024-10-28 05:29:33 -07:00
tokenizer.h 7x compile time speedup: shard gemma.cc 2024-07-03 06:35:04 -07:00
weights.cc Use NestedPools, add NUMA infra 2024-10-18 08:11:18 -07:00
weights.h Add support for 448px resolution to PaliGemma and PaliGemma2. 2024-12-09 11:38:10 -08:00