This website requires JavaScript.
Explore
Help
Sign In
happyz
/
llama.cpp
mirror of
https://github.com/ggerganov/llama.cpp.git
Watch
1
Star
0
Fork
You've already forked llama.cpp
0
Code
Issues
Packages
Projects
Releases
Wiki
Activity
ddf98bdf28
llama.cpp
/
tests
History
Xuan Son Nguyen
399b39f21b
Merge branch 'master' into xsn/server_model_management_v1_2
2025-11-24 14:45:57 +01:00
..
.gitignore
…
CMakeLists.txt
…
get-model.cpp
…
get-model.h
…
run-json-schema-to-grammar.mjs
…
test-alloc.cpp
ggml : fix graph reallocation with multiple chunks (
#16396
)
2025-10-03 13:49:08 +02:00
test-arg-parser.cpp
common : remove common_has_curl() (
#16351
)
2025-09-30 17:39:44 +03:00
test-autorelease.cpp
…
test-backend-ops.cpp
cuda : support non-contiguous i32 to i32 copy (
#17326
)
2025-11-23 11:13:34 +01:00
test-barrier.cpp
test-barrier : do not use more threads than physically available (
#16389
)
2025-10-02 20:10:12 +02:00
test-c.c
…
test-chat-parser.cpp
common : handle unicode during partial json parsing (
#16526
)
2025-10-12 16:18:47 +03:00
test-chat-template.cpp
chat : Granite Docling stopping (
#16438
)
2025-10-06 18:59:40 +02:00
test-chat.cpp
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (
#16932
)
2025-11-18 18:54:15 +01:00
test-double-float.cpp
…
test-gbnf-validator.cpp
…
test-gguf.cpp
…
test-grammar-integration.cpp
grammar : use int64_t to avoid int overflows in int schema to grammar conversion logic (
#16626
)
2025-10-17 08:59:31 +03:00
test-grammar-llguidance.cpp
…
test-grammar-parser.cpp
…
test-json-partial.cpp
common : handle unicode during partial json parsing (
#16526
)
2025-10-12 16:18:47 +03:00
test-json-schema-to-grammar.cpp
grammar : support array references in json schema (
#16792
)
2025-10-28 09:37:52 +01:00
test-llama-grammar.cpp
…
test-log.cpp
…
test-lora-conversion-inference.sh
…
test-model-load-cancel.cpp
…
test-mtmd-c-api.c
…
test-opt.cpp
…
test-quantize-fns.cpp
…
test-quantize-perf.cpp
…
test-quantize-stats.cpp
fix compile
2025-11-21 23:26:32 +01:00
test-regex-partial.cpp
…
test-rope.cpp
ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (
#16805
)
2025-11-11 13:33:24 +02:00
test-sampling.cpp
…
test-thread-safety.cpp
server : support unified cache across slots (
#16736
)
2025-11-02 18:14:04 +02:00
test-tokenizer-0.cpp
…
test-tokenizer-0.py
…
test-tokenizer-0.sh
…
test-tokenizer-1-bpe.cpp
…
test-tokenizer-1-spm.cpp
…
test-tokenizer-random.py
…
test-tokenizers-repo.sh
…