| .. |
|
peg-parser
|
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
2025-12-03 12:45:32 +02:00 |
|
.gitignore
|
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
2025-12-03 12:45:32 +02:00 |
|
CMakeLists.txt
|
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
2025-12-03 12:45:32 +02:00 |
|
get-model.cpp
|
…
|
|
|
get-model.h
|
…
|
|
|
run-json-schema-to-grammar.mjs
|
…
|
|
|
test-alloc.cpp
|
ggml : fix graph reallocation with multiple chunks (#16396)
|
2025-10-03 13:49:08 +02:00 |
|
test-arg-parser.cpp
|
arg: fix common_params_parse not accepting negated arg (#17991)
|
2025-12-13 12:53:37 +01:00 |
|
test-autorelease.cpp
|
…
|
|
|
test-backend-ops.cpp
|
vulkan: Multi-pass softmax for large number of cols (#17892)
|
2025-12-13 10:04:29 +01:00 |
|
test-barrier.cpp
|
Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes (#17748)
|
2025-12-10 12:32:23 -08:00 |
|
test-c.c
|
…
|
|
|
test-chat-parser.cpp
|
common : handle unicode during partial json parsing (#16526)
|
2025-10-12 16:18:47 +03:00 |
|
test-chat-peg-parser.cpp
|
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
2025-12-03 12:45:32 +02:00 |
|
test-chat-template.cpp
|
chat : Granite Docling stopping (#16438)
|
2025-10-06 18:59:40 +02:00 |
|
test-chat.cpp
|
common : add parser for ministral/mistral large 3/devstral 2 (#17713)
|
2025-12-09 17:31:04 -06:00 |
|
test-double-float.cpp
|
…
|
|
|
test-gbnf-validator.cpp
|
…
|
|
|
test-gguf.cpp
|
…
|
|
|
test-grammar-integration.cpp
|
llama : add token matching support to llama-grammar (#17816)
|
2025-12-09 00:32:57 -06:00 |
|
test-grammar-llguidance.cpp
|
…
|
|
|
test-grammar-parser.cpp
|
llama : add token matching support to llama-grammar (#17816)
|
2025-12-09 00:32:57 -06:00 |
|
test-json-partial.cpp
|
common : handle unicode during partial json parsing (#16526)
|
2025-10-12 16:18:47 +03:00 |
|
test-json-schema-to-grammar.cpp
|
Server: Change Invalid Schema from Server Error (500) to User Error (400) (#17572)
|
2025-12-02 17:33:50 +01:00 |
|
test-llama-grammar.cpp
|
llama : add token matching support to llama-grammar (#17816)
|
2025-12-09 00:32:57 -06:00 |
|
test-log.cpp
|
…
|
|
|
test-lora-conversion-inference.sh
|
cli: new CLI experience (#17824)
|
2025-12-10 15:28:59 +01:00 |
|
test-model-load-cancel.cpp
|
…
|
|
|
test-mtmd-c-api.c
|
…
|
|
|
test-opt.cpp
|
…
|
|
|
test-peg-parser.cpp
|
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
2025-12-03 12:45:32 +02:00 |
|
test-quantize-fns.cpp
|
…
|
|
|
test-quantize-perf.cpp
|
ci: run the x64 and arm ci on the github machines instead (#16183)
|
2025-09-25 08:06:06 +03:00 |
|
test-quantize-stats.cpp
|
server: introduce API for serving / loading / unloading multiple models (#17470)
|
2025-12-01 19:41:04 +01:00 |
|
test-regex-partial.cpp
|
…
|
|
|
test-rope.cpp
|
ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805)
|
2025-11-11 13:33:24 +02:00 |
|
test-sampling.cpp
|
…
|
|
|
test-thread-safety.cpp
|
server : support unified cache across slots (#16736)
|
2025-11-02 18:14:04 +02:00 |
|
test-tokenizer-0.cpp
|
…
|
|
|
test-tokenizer-0.py
|
…
|
|
|
test-tokenizer-0.sh
|
…
|
|
|
test-tokenizer-1-bpe.cpp
|
…
|
|
|
test-tokenizer-1-spm.cpp
|
…
|
|
|
test-tokenizer-random.py
|
…
|
|
|
test-tokenizers-repo.sh
|
devops: add s390x & ppc64le CI (#15925)
|
2025-09-27 02:03:33 +08:00 |