llama.cpp/examples
Daniel Bevenius 0a271d82b4
model-conversion : add verbose flag in run-org-model.py (#18194)
This commit adds a --verbose flag to the run-org-model.py script to
enable or disable detailed debug output, such as input and output
tensors for each layer. Debug utilities (summarize, debug_hook,
setup_rope_debug) have been moved to utils/common.py.

The motivation for this is that the detailed debug output can be useful
for diagnosing issues with model conversion or execution, but it can
also produce a large amount of output that may not always be needed.

The script will also be further cleaned/refactored in follow-up commits.
2025-12-19 08:43:16 +01:00
..
batched common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
batched.swift
convert-llama2c-to-ggml
deprecation-warning
diffusion models : Added support for RND1 Diffusion Language Model (#17433) 2025-11-24 14:16:56 +08:00
embedding common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
eval-callback common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
gen-docs arg: clarify auto kvu/np being set on server (#17997) 2025-12-16 12:01:27 +01:00
gguf examples(gguf): GGUF example outputs (#17025) 2025-11-05 19:58:16 +02:00
gguf-hash
idle metal : add residency sets keep-alive heartbeat (#17766) 2025-12-05 19:38:54 +02:00
llama.android android: fix missing screenshots for Android.md (#18156) 2025-12-19 09:32:04 +02:00
llama.swiftui
lookahead common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
lookup common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
model-conversion model-conversion : add verbose flag in run-org-model.py (#18194) 2025-12-19 08:43:16 +01:00
parallel common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
passkey
retrieval common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
save-load-state common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
simple examples : support encoder-decoder models in the simple example (#16002) 2025-09-17 10:29:00 +03:00
simple-chat
simple-cmake-pkg examples : add missing code block end marker [no ci] (#17756) 2025-12-04 14:17:30 +01:00
speculative common : restore grammar-based rejection sampling (#18137) 2025-12-17 19:46:00 +02:00
speculative-simple common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
sycl sycl : support to malloc memory on device more than 4GB, update the doc and script (#17566) 2025-11-29 14:59:44 +02:00
training common : refactor common_sampler + grammar logic changes (#17937) 2025-12-14 10:11:13 +02:00
CMakeLists.txt metal : add residency sets keep-alive heartbeat (#17766) 2025-12-05 19:38:54 +02:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py common : fix json schema with '\' in literals (#17307) 2025-11-29 17:06:32 +01:00
llama.vim
pydantic_models_to_grammar.py
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh