gemma.cpp/python
Jan Wassenberg e5c81f64a1 Major refactor: clarify query_idx (global) vs qi. Refs #607
Fix missing pos increment for last prefill and check that in gemma_test.
Thanks to @ufownl for pointing this out.

Change argument lists to QBatch with accessors.
Increase default seq_len to 8k.

PiperOrigin-RevId: 771937385
2025-06-16 02:42:02 -07:00
..
BUILD.bazel Minor: rename compression/shared -> types.h 2025-05-13 06:53:21 -07:00
configs.cc MatPtr-ify KV, shared div_seq_len, --seq_len flag 2025-06-11 09:49:38 -07:00
convert_from_safetensors.py Minor: mark command line flags as required 2025-05-12 08:30:44 -07:00
gemma_py.cc Major refactor: clarify query_idx (global) vs qi. Refs #607 2025-06-16 02:42:02 -07:00
requirements.txt Add python wrappers for configs and inference. 2025-01-28 08:22:03 -08:00
run_example.py Fix a problem in run_example.py 2025-06-04 00:42:57 -07:00