Commit Graph

3 Commits

Author SHA1 Message Date
Heiner 08427630c3 Use only one list of weight names, with values from the gguf module.
This saves weights in the order in which they are in the Grok-1 files.
Since we operate weight-by-weight now, we no longer need caches and
name2key translations.

Per reviewer request, I also moved to using keys in gguf.TENSOR_NAMES.
2024-05-25 13:27:43 +02:00
Heiner 3c57743874 Don't split MoE weights.
As per https://github.com/ggerganov/llama.cpp/pull/7058#issuecomment-2092967508.
This helps avoid a memcopy when running.
2024-05-25 13:27:43 +02:00
Heiner 6ddf93b286 Script to convert Grok-1 weights from raw JAX pickle files. 2024-05-25 13:27:43 +02:00