llama.cpp

History

Kwa Jie Hao 243532e556 jinja : support ensure_ascii=true, string repetition and int/float self-filtering (#21623 ) * feat: jinja engine improvements for reka-edge Port three Jinja engine improvements needed for the reka-edge model: 1. Python-style string repetition ("ab" * 3 → "ababab") 2. ensure_ascii=true support for tojson filter (escapes non-ASCII to \uXXXX) 3. int() builtin on value_int_t (identity, needed for Reka Edge template) * fix: escape invalid utf8 bytes when ensure_ascii=true The json_ensure_ascii_preserving_format function does not correctly handle an edge case where if UTF-8 parsing fails, it adds the non-ascii character back to the output as a raw byte. This commit fixes that by adding the unicode standard replacement character \\ufffd to the output instead. This is the standard behavior for various programming languages like Python, Rust, Go, etc. * chore: address PR comments 1. Add todo comment for supporting string repetition for array/tuples 2. Add support for float identity operation 3. Move invalid ascii test case to test_fuzzing * chore: accept suggestion for common/jinja/value.cpp Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>		2026-04-09 11:28:33 +02:00
..
peg-parser	common : add commentary rules for gpt-oss-20b (#21286 )	2026-04-02 08:59:59 -05:00
snapshots	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
.gitignore	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
CMakeLists.txt	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
export-graph-ops.cpp	tests: allow exporting graph ops from HF file without downloading weights (#21182 )	2026-04-02 18:19:20 +02:00
get-model.cpp	ci : add model tests + script wrapper (#4586 )	2024-01-26 14:18:00 +02:00
get-model.h	ci : add model tests + script wrapper (#4586 )	2024-01-26 14:18:00 +02:00
gguf-model-data.cpp	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
gguf-model-data.h	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
test-alloc.cpp	chore : correct typos [no ci] (#20041 )	2026-03-05 08:50:21 +01:00
test-arg-parser.cpp	ci, tests : use cmake to download models and remove libcurl dependency (#18791 )	2026-01-14 07:46:27 +01:00
test-autorelease.cpp	docs : Minor cleanups (#19252 )	2026-02-02 08:38:55 +02:00
test-backend-ops.cpp	metal: Q1_0 backend (#21528 )	2026-04-08 16:07:47 +03:00
test-backend-sampler.cpp	tests: enable kv_unified to prevent cuda oom error on rtx 2060 (#20645 )	2026-03-18 17:40:22 +08:00
test-barrier.cpp	Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes (#17748 )	2025-12-10 12:32:23 -08:00
test-c.c	ggml : remove kompute backend (#14501 )	2025-07-03 07:48:32 +03:00
test-chat-auto-parser.cpp	common/parser: fix reasoning whitespace bugs + extra parser tests (#21085 )	2026-03-28 07:29:26 +01:00
test-chat-peg-parser.cpp	common/parser: add proper reasoning tag prefill reading (#20424 )	2026-03-19 16:58:21 +01:00
test-chat-template.cpp	chat : add Granite 4.0 chat template with correct tool_call role mapping (#20804 )	2026-04-02 11:28:56 +02:00
test-chat.cpp	Propose fix a couple of typos (#21581 )	2026-04-08 16:29:03 +02:00
test-double-float.cpp	ggml : minor naming changes (#8433 )	2024-07-12 10:46:02 +03:00
test-gbnf-validator.cpp	cmake : do not include ./src as public for libllama (#13062 )	2025-04-24 16:00:10 +03:00
test-gguf-model-data.cpp	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
test-gguf.cpp	llama: fix llama-model-saver (#20503 )	2026-03-25 12:53:16 +02:00
test-grammar-integration.cpp	common/grammar: fix grammar parsing issues to prevent stack overflow and hangs (#18604 )	2026-03-21 18:43:35 +01:00
test-grammar-llguidance.cpp	tool/ex/tests: consistently free ctx, then model (#18168 )	2025-12-22 11:00:37 +01:00
test-grammar-parser.cpp	common/grammar: fix grammar parsing issues to prevent stack overflow and hangs (#18604 )	2026-03-21 18:43:35 +01:00
test-jinja.cpp	jinja : support ensure_ascii=true, string repetition and int/float self-filtering (#21623 )	2026-04-09 11:28:33 +02:00
test-json-partial.cpp	common : handle unicode during partial json parsing (#16526 )	2025-10-12 16:18:47 +03:00
test-json-schema-to-grammar.cpp	tests : remove obsolete .mjs script (#21615 )	2026-04-08 13:20:46 +03:00
test-llama-archs.cpp	model, mtmd: fix gguf conversion for audio/vision mmproj (#21309 )	2026-04-02 17:10:32 +02:00
test-llama-grammar.cpp	common/grammar: fix grammar parsing issues to prevent stack overflow and hangs (#18604 )	2026-03-21 18:43:35 +01:00
test-log.cpp	common : use common_ prefix for common library functions (#9805 )	2024-10-10 22:57:42 +02:00
test-lora-conversion-inference.sh	cli: new CLI experience (#17824 )	2025-12-10 15:28:59 +01:00
test-model-load-cancel.cpp	llama : update llama_model API names (#11063 )	2025-01-06 10:55:18 +02:00
test-mtmd-c-api.c	mtmd : add C public API (#13184 )	2025-05-04 23:43:42 +02:00
test-opt.cpp	tests : fix test-opt with GGML_BACKEND_DL (#15599 )	2025-08-26 22:14:38 +02:00
test-peg-parser.cpp	Autoparser - complete refactoring of parser architecture (#18675 )	2026-03-06 21:01:00 +01:00
test-quant-type-selection.cpp	tests : add unit test coverage for llama_tensor_get_type (#20112 )	2026-04-02 22:53:58 +02:00
test-quantize-fns.cpp	ggml: add Q1_0 1-bit quantization support (CPU) (#21273 )	2026-04-06 20:55:21 +02:00
test-quantize-perf.cpp	ci: run the x64 and arm ci on the github machines instead (#16183 )	2025-09-25 08:06:06 +03:00
test-quantize-stats.cpp	server: introduce API for serving / loading / unloading multiple models (#17470 )	2025-12-01 19:41:04 +01:00
test-reasoning-budget.cpp	common : inhibit lazy grammar sampler while reasoning is active (#20970 )	2026-03-27 18:30:40 +01:00
test-regex-partial.cpp	common/grammar : replace problematic backtracking regex `[\s\S]*` (#18342 )	2026-01-03 16:02:43 -06:00
test-rope.cpp	ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805 )	2025-11-11 13:33:24 +02:00
test-sampling.cpp	sampling : optimize samplers by reusing bucket sort (#15665 )	2025-08-31 20:41:02 +03:00
test-state-restore-fragmented.cpp	common : move up common_init() and fix Windows UTF-8 logs (#21176 )	2026-03-31 12:53:41 +02:00
test-thread-safety.cpp	common : move up common_init() and fix Windows UTF-8 logs (#21176 )	2026-03-31 12:53:41 +02:00
test-tokenizer-0.cpp	tool/ex/tests: consistently free ctx, then model (#18168 )	2025-12-22 11:00:37 +01:00
test-tokenizer-0.py	py : logging and flake8 suppression refactoring (#7081 )	2024-05-05 08:07:48 +03:00
test-tokenizer-0.sh	model : add Jina Embeddings v5 Nano (partial EuroBERT) support (#19826 )	2026-02-26 12:14:09 +01:00
test-tokenizer-1-bpe.cpp	tool/ex/tests: consistently free ctx, then model (#18168 )	2025-12-22 11:00:37 +01:00
test-tokenizer-1-spm.cpp	tool/ex/tests: consistently free ctx, then model (#18168 )	2025-12-22 11:00:37 +01:00
test-tokenizer-random.py	ci : switch from pyright to ty (#20826 )	2026-03-21 08:54:34 +01:00
test-tokenizers-repo.sh	devops: add s390x & ppc64le CI (#15925 )	2025-09-27 02:03:33 +08:00
testing.h	common : implement new jinja template engine (#18462 )	2026-01-16 11:22:06 +01:00