gemma.cpp

History

Yao Chen 029f2d3e98 Implement the matmul op with Onednn to leverage AMX optimization. PiperOrigin-RevId: 683370269		2024-10-08 12:18:35 -07:00
..
evals	Add MMLU eval to github	2024-05-20 10:20:53 -07:00
instantiations	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
activations.h	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
common.cc	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
common.h	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
configs.h	Internal change.	2024-10-02 14:03:34 -07:00
gemma-inl.h	Fix MSAN issue for multiturn. Rewind the prior EOS token.	2024-10-07 08:07:54 -07:00
gemma.cc	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
gemma.h	Implement the matmul op with Onednn to leverage AMX optimization.	2024-10-08 12:18:35 -07:00
kv_cache.cc	Major revamp #2 of Prefill: fix token order, parallel for multi-query	2024-07-25 03:28:55 -07:00
kv_cache.h	Major revamp #2 of Prefill: fix token order, parallel for multi-query	2024-07-25 03:28:55 -07:00
run.cc	Add NestedPools: one per socket/cluster	2024-10-07 09:40:19 -07:00
tokenizer.cc	Add support for PaliGemma Vision-LM (224x224) to gemma.cpp	2024-09-23 10:09:38 -07:00
tokenizer.h	7x compile time speedup: shard gemma.cc	2024-07-03 06:35:04 -07:00
weights.cc	-467ms startup: parallel Reshape	2024-10-01 04:11:35 -07:00
weights.h	Moved Internal code around to simplify	2024-10-03 07:55:21 -07:00