gemma.cpp/compression/python
Jan Wassenberg e76e29ce11 De-singleton ThreadingContext so callers can pass in their own
weights.cc: fix BindB argument for bf16 tensors
threading_test: enable autotune
PiperOrigin-RevId: 785763618
2025-07-22 02:08:46 -07:00
..
pytree Add Python code for converting Griffin Orbax weights. Refs #301 2024-07-29 12:53:30 -07:00
BUILD.bazel Minor: rename compression/shared -> types.h 2025-05-13 06:53:21 -07:00
compression_clif_aux.cc De-singleton ThreadingContext so callers can pass in their own 2025-07-22 02:08:46 -07:00
compression_clif_aux.h 3.8x speedup of weights loading via preadv on Linux 2025-05-15 11:55:15 -07:00
compression_extension.cc Minor: rename compression/shared -> types.h 2025-05-13 06:53:21 -07:00
compression_test.py 1.16x decode speedup: remove last MatVec in Attention 2025-06-02 09:40:29 -07:00
requirements.txt Add python wrappers for configs and inference. 2025-01-28 08:22:03 -08:00