gemma.cpp/gemma
Paul Chang 419dc34ed5 Generic MHA/MQA/GQA implementation
PiperOrigin-RevId: 636937885
2024-05-24 09:05:53 -07:00
..
evals Add MMLU eval to github 2024-05-20 10:20:53 -07:00
benchmark.cc Add MMLU eval to github 2024-05-20 10:20:53 -07:00
compress_weights.cc Improve documentation for compress_weights flags 2024-04-29 06:49:50 -07:00
configs.h Support additional scaling 2024-05-07 08:16:25 -07:00
gemma.cc Generic MHA/MQA/GQA implementation 2024-05-24 09:05:53 -07:00
gemma.h Add MMLU eval to github 2024-05-20 10:20:53 -07:00
gemma_test.cc Pass most runtime parameters using const RuntimeConfig& 2024-05-14 07:04:53 -07:00
ops.h Fix normalization in Softmax function. 2024-05-24 08:58:31 +00:00
ops_test.cc Fix normalization in Softmax function. 2024-05-24 08:58:31 +00:00
run.cc Refer to --weights rather than --compressed_weights to simplify CLI docs 2024-05-16 07:51:49 -07:00
run_mmlu.cc Add MMLU eval to github 2024-05-20 10:20:53 -07:00