gemma.cpp

History

Jan Wassenberg 8532da47f7 Major refactor of allocator/args: use new ThreadingContext2 instead of monostate/init in each frontend Add ThreadingArgs(replaces AppArgs) backprop: use Packed() accessor and MakePacked factory and row-based access to allow for stride compress_weights: remove, moving to py-only exporter instead Move MatPtr to mat.h and revise interface: - Generic MatOwner - rename accessors to Packed* - support stride/row accessors, fix RowPtr stride Add TypeBits(Type) Move GenerateMat to test_util-inl for sharing between matmul test/bench Move internal init to gemma.cc to avoid duplication Rename GemmaEnv model_ to gemma_ for disambiguating vs upcoming ModelStorage Remove --compressed_weights, use --weights instead. tensor_index: add ExtentsFromInfo and TensorIndexLLM/Img Allocator: use normal unique_ptr for AllocBytes so users can call directly threading: use -> because AlignedPtr no longer assumes arrays PiperOrigin-RevId: 745918637		2025-04-10 01:29:54 -07:00
..
allocator.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
allocator.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
args.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
basics.h	Minor cleanup: enable 0,0 Extents2D, add SerializedSpan typedef, include fixes	2025-04-08 03:35:55 -07:00
mat.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
mat.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
test_util.h	Minor cleanup/fixes:	2024-09-09 06:58:09 -07:00
threading.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
threading.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
threading_context.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
threading_context.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
threading_test.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
topology.cc	Add new singleton Allocator2 instead of monostate	2025-04-08 09:00:59 -07:00
topology.h	Refactor Gemma ctor and improve pool NUMA support	2025-03-14 10:19:00 -07:00