llama.cpp/scripts
Piotr Wilkin 6c0bec52f4 The great quant laboratory squash 2026-04-13 15:03:20 +02:00
..
apple
hip ggml-cuda: Add generic NVFP4 MMQ kernel (#21074) 2026-04-01 12:04:58 +02:00
jinja ci : switch from pyright to ty (#20826) 2026-03-21 08:54:34 +01:00
snapdragon hexagon: improved Op queuing, buffer and cache management (#21705) 2026-04-10 15:47:43 -07:00
analyze-ffn-down.py The great quant laboratory squash 2026-04-13 15:03:20 +02:00
bench-models.sh benches : update models + numbers (#19359) 2026-02-05 14:34:07 +02:00
build-info.sh
check-requirements.sh
compare-commits.sh
compare-llama-bench.py llama-bench: add `-fitc` and `-fitt` to arguments (#21304) 2026-04-06 22:26:02 +08:00
compare-logprobs.py scripts: update corpus of compare-logprobs (#19326) 2026-02-25 12:57:34 +01:00
compute-imatrix.py The great quant laboratory squash 2026-04-13 15:03:20 +02:00
create_ops_docs.py
debug-test.sh refactor : remove libcurl, use OpenSSL when available (#18828) 2026-01-14 18:02:47 +01:00
extract-activations.py The great quant laboratory squash 2026-04-13 15:03:20 +02:00
extract-tensor-data.py The great quant laboratory squash 2026-04-13 15:03:20 +02:00
fetch_server_test_models.py server: Add cached_tokens info to oaicompat responses (#19361) 2026-03-19 19:09:33 +01:00
gen-authors.sh
gen-unicode-data.py ci : bump ty to 0.0.26 (#21156) 2026-03-30 09:29:15 +02:00
get-flags.mk
get-hellaswag.sh scripts : update get-hellaswag.sh and get-winogrande.sh (#20542) 2026-03-14 11:21:50 +01:00
get-pg.sh
get-wikitext-2.sh scripts : improve get-wikitext-2.sh (#19952) 2026-03-02 15:40:49 +01:00
get-winogrande.sh scripts : update get-hellaswag.sh and get-winogrande.sh (#20542) 2026-03-14 11:21:50 +01:00
get_chat_template.py
git-bisect-run.sh llama: end-to-end tests (#19802) 2026-03-08 12:30:21 +01:00
git-bisect.sh llama: end-to-end tests (#19802) 2026-03-08 12:30:21 +01:00
hf.sh
install-oneapi.bat
pr2wt.sh chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
serve-static.js refactor : remove libcurl, use OpenSSL when available (#18828) 2026-01-14 18:02:47 +01:00
server-bench.py ci : switch from pyright to ty (#20826) 2026-03-21 08:54:34 +01:00
server-test-function-call.py scripts: add function call test script (#21234) 2026-04-01 15:31:58 +02:00
server-test-model.py Autoparser - complete refactoring of parser architecture (#18675) 2026-03-06 21:01:00 +01:00
sync-ggml-am.sh
sync-ggml.last sync : ggml 2026-04-02 10:39:00 +03:00
sync-ggml.sh
sync_vendor.py vendor : update cpp-httplib to 0.40.0 (#21100) 2026-03-28 08:59:44 +01:00
tool_bench.py refactor : remove libcurl, use OpenSSL when available (#18828) 2026-01-14 18:02:47 +01:00
tool_bench.sh
verify-checksum-models.py
xxd.cmake