gemma.cpp/ops
Jan Wassenberg 8532da47f7 Major refactor of allocator/args:
use new ThreadingContext2 instead of monostate/init in each frontend
Add ThreadingArgs(replaces AppArgs)

backprop: use Packed() accessor and MakePacked factory and row-based access to allow for stride
compress_weights: remove, moving to py-only exporter instead

Move MatPtr to mat.h and revise interface:
- Generic MatOwner
- rename accessors to Packed*
- support stride/row accessors, fix RowPtr stride

Add TypeBits(Type)
Move GenerateMat to test_util-inl for sharing between matmul test/bench
Move internal init to gemma.cc to avoid duplication
Rename GemmaEnv model_ to gemma_ for disambiguating vs upcoming ModelStorage
Remove --compressed_weights, use --weights instead.
tensor_index: add ExtentsFromInfo and TensorIndexLLM/Img
Allocator: use normal unique_ptr for AllocBytes so users can call directly
threading: use -> because AlignedPtr no longer assumes arrays
PiperOrigin-RevId: 745918637
2025-04-10 01:29:54 -07:00
..
bench_matmul.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
dot-inl.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
dot_test.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
fp_arith-inl.h Cascaded summation for Softmax 2024-09-20 10:31:23 -07:00
gemma_matvec_test.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
matmul-inl.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
matmul.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
matmul.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
matmul_test.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
matvec-inl.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
ops-inl.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
ops.h Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
ops_test.cc Major refactor of allocator/args: 2025-04-10 01:29:54 -07:00
sum-inl.h Minor cleanup, Windows+Bazel build fixes 2024-10-10 09:05:06 -07:00