llama.cpp/examples
Georgi Gerganov 1442677f92
common : refactor cli arg parsing (#7675)
* common : gpt_params_parse do not print usage

* common : rework usage print (wip)

* common : valign

* common : rework print_usage

* infill : remove cfg support

* common : reorder args

* server : deduplicate parameters

ggml-ci

* common : add missing header

ggml-ci

* common : remote --random-prompt usages

ggml-ci

* examples : migrate to gpt_params

ggml-ci

* batched-bench : migrate to gpt_params

* retrieval : migrate to gpt_params

* common : change defaults for escape and n_ctx

* common : remove chatml and instruct params

ggml-ci

* common : passkey use gpt_params
2024-06-04 21:23:39 +03:00
..
baby-llama
batched common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
batched-bench common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
batched.swift llama : add option to render special/control tokens (#6807) 2024-04-21 18:36:45 +03:00
benchmark ggml : remove old quantization functions (#5942) 2024-03-09 15:53:59 +02:00
convert-llama2c-to-ggml train : change default FA argument (#7528) 2024-05-25 15:22:35 +03:00
embedding common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
eval-callback common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
export-lora
finetune ggml : remove ggml_flash_attn and ggml_flash_ff (#7463) 2024-05-23 10:00:44 +03:00
gbnf-validator grammars: 1.5x faster inference w/ complex grammars (vector reserves / reuses) (#6609) 2024-04-11 19:47:34 +01:00
gguf gguf : add option to not check tensor data (#6582) 2024-04-10 21:16:48 +03:00
gguf-split common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
gritlm common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
imatrix common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
infill common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
jeopardy
llama-bench common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
llama.android android : module (#7502) 2024-05-25 11:11:33 +03:00
llama.swiftui llama : add option to render special/control tokens (#6807) 2024-04-21 18:36:45 +03:00
llava common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
lookahead common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
lookup common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
main common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
main-cmake-pkg ggml : remove OpenCL (#7735) 2024-06-04 21:23:20 +03:00
parallel common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
passkey common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
perplexity common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
quantize common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
quantize-stats Improve usability of --model-url & related flags (#6930) 2024-04-30 00:52:50 +01:00
retrieval common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
rpc [SYCL] Update rpc-server.cpp to include SYCL backend (#7682) 2024-06-02 12:13:54 +03:00
save-load-state common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
server common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
simple common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
speculative common : refactor cli arg parsing (#7675) 2024-06-04 21:23:39 +03:00
sycl add build shared lib in win release package (#7438) 2024-05-24 10:06:56 +08:00
tokenize Make tokenize CLI tool have nicer command line arguments. (#6188) 2024-05-25 11:14:42 +10:00
train-text-from-scratch ggml : remove ggml_flash_attn and ggml_flash_ff (#7463) 2024-05-23 10:00:44 +03:00
CMakeLists.txt llama : remove beam search (#7736) 2024-06-04 21:23:05 +03:00
Miku.sh
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
convert-legacy-llama.py Move convert.py to examples/convert-legacy-llama.py (#7430) 2024-05-30 21:40:00 +10:00
gpt4all.sh
json-schema-pydantic-example.py json-schema-to-grammar improvements (+ added to server) (#5978) 2024-03-21 11:50:43 +00:00
json_schema_to_grammar.py JSON schema conversion: ️ faster repetitions, min/maxLength for strings, cap number length (#6555) 2024-04-12 19:43:38 +01:00
llama.vim
llama2-13b.sh
llama2.sh
llm.vim
pydantic-models-to-grammar-examples.py
pydantic_models_to_grammar.py
reason-act.sh
regex-to-grammar.py JSON schema conversion: ️ faster repetitions, min/maxLength for strings, cap number length (#6555) 2024-04-12 19:43:38 +01:00
server-embd.py server : refactor (#5882) 2024-03-07 11:41:53 +02:00
server-llama2-13B.sh
ts-type-to-grammar.sh JSON schema conversion: ️ faster repetitions, min/maxLength for strings, cap number length (#6555) 2024-04-12 19:43:38 +01:00