llama.cpp

History

Todor Boinovski ce38a4db47 hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 ) * hexagon: updates to enable offloading to HTP on WoS * Update windows.md * Update windows.md * hexagon: enable -O3 optimizations * hexagon: move all _WINDOWS conditional compilation to _WIN32 * hexagon: updates to enable offloading to HTP on WoS * hexagon: use run-time vs load-time dynamic linking for cdsp driver interface * refactor htp-drv * hexagon: add run-bench.ps1 script * hexagon: htdrv refactor * hexagon: unify Android and Windows build readmes * hexagon: update README.md * hexagon: refactor htpdrv * hexagon: drv refactor * hexagon: more drv refactor * hexagon: fixes for android builds * hexagon: factor out dl into ggml-backend-dl * hexagon: add run-tool.ps1 script * hexagon: merge htp-utils in htp-drv and remove unused code * wos: no need for getopt_custom.h * wos: add missing CR in htpdrv * hexagon: ndev enforecement applies only to the Android devices * hexagon: add support for generating and signing .cat file * hexagon: add .inf file * hexagon: working auto-signing and improved windows builds * hexagon: futher improve skel build * hexagon: add rough WoS guide * hexagon: updated windows guide * hexagon: improve cmake handling of certs and logging * hexagon: improve windows setup/build doc * hexagon: more windows readme updates * hexagon: windows readme updates * hexagon: windows readme updates * hexagon: windows readme updates * hexagon: windows readme updates * Update windows.md * Update windows.md * snapdragon: rename docs/backend/hexagon to docs/backends/snapdragon Also added a power shell script to simplify build env setup. * hexagon: remove trailing whitespace and move cmake requirement to user-presets * hexagon: fix CMakeUserPresets path in workflow yaml * hexagon: introduce local version of libdl.h * hexagon: fix src1 reuse logic gpt-oss needs a bigger lookahead window. The check for src[1] itself being quantized was wrong. --------- Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>		2026-01-29 12:33:21 -08:00
..
apple	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
jinja	scripts : add Jinja tester PySide6 simple app (#15756 )	2025-09-05 01:05:12 +02:00
snapdragon	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
bench-models.sh	scripts : add script to bench models (#16894 )	2025-11-02 00:15:31 +02:00
build-info.sh	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
check-requirements.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
compare-commits.sh	scripts: add sqlite3 check for compare-commits.sh (#15633 )	2025-08-28 19:23:22 +08:00
compare-llama-bench.py	ggml-cuda: enable cuda-graphs for `n-cpu-moe` (#18934 )	2026-01-24 14:25:20 +08:00
compare-logprobs.py	scripts: add script to compare logprobs of llama.cpp against other frameworks (#17947 )	2025-12-13 22:33:29 +01:00
create_ops_docs.py	Docs: add instructions for adding backends (#14889 )	2025-07-27 09:36:43 +08:00
debug-test.sh	refactor : remove libcurl, use OpenSSL when available (#18828 )	2026-01-14 18:02:47 +01:00
fetch_server_test_models.py	llama : move end-user examples to tools directory (#13249 )	2025-05-02 20:27:13 +02:00
gen-authors.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
gen-unicode-data.py	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
get-flags.mk	build : pass all warning flags to nvcc via -Xcompiler (#5570 )	2024-02-18 16:21:52 -05:00
get-hellaswag.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
get-pg.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
get-wikitext-2.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
get-wikitext-103.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
get-winogrande.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
get_chat_template.py	scripts: corrected encoding when getting chat template (#11866 ) (#11907 )	2025-02-18 10:30:16 +01:00
hf.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
install-oneapi.bat	support SYCL backend windows build (#5208 )	2024-01-31 08:08:07 +05:30
pr2wt.sh	scripts : follow api redirects in pr2wt.sh (#18739 )	2026-01-10 16:04:05 +01:00
serve-static.js	refactor : remove libcurl, use OpenSSL when available (#18828 )	2026-01-14 18:02:47 +01:00
server-bench.py	llama: use FA + max. GPU layers by default (#15434 )	2025-08-30 16:32:10 +02:00
sync-ggml-am.sh	scripts : update sync scripts	2025-08-18 22:06:44 +03:00
sync-ggml.last	sync : ggml	2025-12-31 18:54:43 +02:00
sync-ggml.sh	scripts : update sync scripts	2025-08-18 22:06:44 +03:00
sync_vendor.py	common : implement new jinja template engine (#18462 )	2026-01-16 11:22:06 +01:00
tool_bench.py	refactor : remove libcurl, use OpenSSL when available (#18828 )	2026-01-14 18:02:47 +01:00
tool_bench.sh	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
verify-checksum-models.py	convert.py : add python logging instead of print() (#6511 )	2024-05-03 22:36:41 +03:00
xxd.cmake	llama : move end-user examples to tools directory (#13249 )	2025-05-02 20:27:13 +02:00