gemma.cpp/gemma
Jan Wassenberg 45ad847a41 Replace RowVectorBatch with MatStorageT
KVCache: add ctor required for MatStorageT, remove Create; bf_pre_ffw_rms_out -> pre_ffw_rms_out
optimize_test: larger vocab_size requires more steps
shared.h: Remove unused u128 type
correctly set Activation matrix rows, avoid passing as arg
ops: pass Mat instead of pointers/sizes; vectorize LayerNorm; support any weight type
mat: add OverrideRows, used by SetBatchSize
PiperOrigin-RevId: 757790736
2025-05-12 09:16:12 -07:00
..
bindings Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
evals Add MMLU eval to github 2024-05-20 10:20:53 -07:00
instantiations Eliminated TConfig. 2024-10-17 05:04:22 -07:00
activations.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
common.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
common.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
configs.cc Remove support for Gemma 1 and PaliGemma 1 models, superseded by (Pali)Gemma 2. 2025-05-09 02:17:27 -07:00
configs.h Remove support for Gemma 1 and PaliGemma 1 models, superseded by (Pali)Gemma 2. 2025-05-09 02:17:27 -07:00
configs_test.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
gemma-inl.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
gemma.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
gemma.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
gemma_args.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
kv_cache.cc Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
kv_cache.h Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
model_store.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
model_store.h Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
run.cc Replace RowVectorBatch with MatStorageT 2025-05-12 09:16:12 -07:00
tensor_info.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
tensor_info.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
tensor_info_test.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
tokenizer.cc Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
tokenizer.h Huge refactor of weight handling and model loading. 2025-05-06 04:44:21 -07:00
weights.cc Move fields, io* and blob* from compression/ into io/ 2025-05-06 11:17:19 -07:00
weights.h Fix gcc build error and gemma3 crash, thanks @ufownl, fixes #551 2025-05-07 00:59:18 -07:00