gemma.cpp

History

Jan Wassenberg 8532da47f7 Major refactor of allocator/args: use new ThreadingContext2 instead of monostate/init in each frontend Add ThreadingArgs(replaces AppArgs) backprop: use Packed() accessor and MakePacked factory and row-based access to allow for stride compress_weights: remove, moving to py-only exporter instead Move MatPtr to mat.h and revise interface: - Generic MatOwner - rename accessors to Packed* - support stride/row accessors, fix RowPtr stride Add TypeBits(Type) Move GenerateMat to test_util-inl for sharing between matmul test/bench Move internal init to gemma.cc to avoid duplication Rename GemmaEnv model_ to gemma_ for disambiguating vs upcoming ModelStorage Remove --compressed_weights, use --weights instead. tensor_index: add ExtentsFromInfo and TensorIndexLLM/Img Allocator: use normal unique_ptr for AllocBytes so users can call directly threading: use -> because AlignedPtr no longer assumes arrays PiperOrigin-RevId: 745918637		2025-04-10 01:29:54 -07:00
..
evals	Add MMLU eval to github	2024-05-20 10:20:53 -07:00
instantiations	Eliminated TConfig.	2024-10-17 05:04:22 -07:00
activations.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
common.cc	Merge pull request #529 from ufownl:refactor/wrap_and_tokenize	2025-04-08 09:22:26 -07:00
common.h	Merge pull request #529 from ufownl:refactor/wrap_and_tokenize	2025-04-08 09:22:26 -07:00
configs.cc	Set the secondary EOS for Gemma2	2025-03-22 01:32:22 +08:00
configs.h	Add support for a secondary EOS token	2025-03-20 12:28:31 -07:00
configs_test.cc	Moved the vit config fields to their own config struct	2025-01-15 01:09:49 -08:00
gemma-inl.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
gemma.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
gemma.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
gemma_args.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
kv_cache.cc	Add KVCache.DeepCopy() . Will be useful for implementing sampling functionality like beam sampling, parallel sampling, CoT Decoding (à la https://arxiv.org/abs/2402.10200 )	2025-02-10 04:10:29 -08:00
kv_cache.h	Add KVCache.DeepCopy() . Will be useful for implementing sampling functionality like beam sampling, parallel sampling, CoT Decoding (à la https://arxiv.org/abs/2402.10200 )	2025-02-10 04:10:29 -08:00
run.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
tensor_index.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
tensor_index.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
tensor_index_test.cc	Moved the vit config fields to their own config struct	2025-01-15 01:09:49 -08:00
tokenizer.cc	Merge pull request #529 from ufownl:refactor/wrap_and_tokenize	2025-04-08 09:22:26 -07:00
tokenizer.h	Merge pull request #529 from ufownl:refactor/wrap_and_tokenize	2025-04-08 09:22:26 -07:00
weights.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
weights.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00