llama.cpp/models
Olivier Chafik bfcce4d693
`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585)
* `tool-call`: support Command R7B (w/ tool_plan return)

* `tool-call`: cleaner preservation of tokens + warn when likely bad chat template override

* `tool-call`: test cleanup / handle lazy grammar triggers
2025-02-02 09:25:38 +00:00
..
templates `tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585) 2025-02-02 09:25:38 +00:00
.editorconfig
ggml-vocab-aquila.gguf
ggml-vocab-baichuan.gguf
ggml-vocab-bert-bge.gguf
ggml-vocab-bert-bge.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-bert-bge.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-chameleon.gguf.inp llama : add support for Chameleon (#8543) 2024-09-28 15:08:43 +03:00
ggml-vocab-chameleon.gguf.out llama : add support for Chameleon (#8543) 2024-09-28 15:08:43 +03:00
ggml-vocab-command-r.gguf
ggml-vocab-command-r.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-command-r.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-coder.gguf
ggml-vocab-deepseek-coder.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-coder.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-llm.gguf
ggml-vocab-deepseek-llm.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-llm.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-deepseek-r1-qwen.gguf.inp llama : add support for Deepseek-R1-Qwen distill model (#11310) 2025-01-20 14:35:07 +01:00
ggml-vocab-deepseek-r1-qwen.gguf.out llama : add support for Deepseek-R1-Qwen distill model (#11310) 2025-01-20 14:35:07 +01:00
ggml-vocab-falcon.gguf
ggml-vocab-falcon.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-falcon.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-gpt-2.gguf
ggml-vocab-gpt-2.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-gpt-2.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-gpt-neox.gguf
ggml-vocab-llama-bpe.gguf
ggml-vocab-llama-bpe.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-llama-bpe.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-llama-spm.gguf
ggml-vocab-llama-spm.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-llama-spm.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-mpt.gguf
ggml-vocab-mpt.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-mpt.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-phi-3.gguf
ggml-vocab-phi-3.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-phi-3.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-qwen2.gguf
ggml-vocab-qwen2.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-qwen2.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-refact.gguf
ggml-vocab-refact.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-refact.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-roberta-bpe.gguf.inp convert : add support for Roberta embeddings (#10695) 2024-12-07 09:02:14 +02:00
ggml-vocab-roberta-bpe.gguf.out convert : add support for Roberta embeddings (#10695) 2024-12-07 09:02:14 +02:00
ggml-vocab-starcoder.gguf
ggml-vocab-starcoder.gguf.inp Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00
ggml-vocab-starcoder.gguf.out Inference support for T5 and FLAN-T5 model families (#5763) 2024-07-04 15:46:11 +02:00