gemma.cpp

History

Jan Wassenberg 8d0882b966 Huge refactor of weight handling and model loading. Weight handling: - new ModelStore2 supports both pre-2025 multi-file and single-file formats - simpler ForEachTensor with TensorArgs - tensors are constructed with their full suffixed name I/O: - support mmap and stride - Simplified SbsWriter, single insert(); add SbsReader Misc: - kMockTokenizer: allow creating with unavailable tokenizer - configs.h: Simpler enum validity checks via kSentinel - matmul.h: remove unused enable_bind (now in allocator.h) - tensor_info: single TensorInfoRegistry class, rename from tensor_index.h Frontends: - Replace Allocate/CreateGemma with ctor(LoaderArgs, MatMulEnv&) - Deduce model/weight type, remove --model and parsing - Replace most common.h includes with configs.h - Remove --compressed_weights, use --weights instead - Remove ModelInfo, replaced by ModelConfig. Backprop: - Reduce max loss, remove backward_scalar_test (timeout) - Update thresholds because new RandInit changes rng eval order and thus numerics PiperOrigin-RevId: 755317484		2025-05-06 04:44:21 -07:00
..
bench_matmul.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
dot-inl.h	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
dot_test.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
fp_arith-inl.h	Cascaded summation for Softmax	2024-09-20 10:31:23 -07:00
gemma_matvec_test.cc	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
matmul-inl.h	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
matmul.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
matmul.h	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
matmul_test.cc	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
matvec-inl.h	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
ops-inl.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
ops.h	Major refactor of allocator/args:	2025-04-10 01:29:54 -07:00
ops_test.cc	Huge refactor of weight handling and model loading.	2025-05-06 04:44:21 -07:00
sum-inl.h	Minor cleanup, Windows+Bazel build fixes	2024-10-10 09:05:06 -07:00