| .. |
|
CMakeLists.txt
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
arg.cpp
|
batched-bench : add "separate text gen" mode (#17103)
|
2025-11-10 12:59:29 +02:00 |
|
arg.h
|
common: move download functions to download.(cpp|h) (#17059)
|
2025-11-07 11:23:34 +01:00 |
|
base64.hpp
|
llava : expose as a shared library for downstream projects (#3613)
|
2023-11-07 00:36:23 +03:00 |
|
build-info.cpp.in
|
cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167)
|
2025-06-13 10:38:52 +02:00 |
|
chat-parser-xml-toolcall.cpp
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
chat-parser-xml-toolcall.h
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
chat-parser.cpp
|
common : handle unicode during partial json parsing (#16526)
|
2025-10-12 16:18:47 +03:00 |
|
chat-parser.h
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
chat.cpp
|
chat: fix int overflow, prevent size calculation in float/double (#17357)
|
2025-11-18 19:11:53 +01:00 |
|
chat.h
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
common.cpp
|
common : more accurate sampling timing (#17382)
|
2025-11-20 13:40:10 +02:00 |
|
common.h
|
common : more accurate sampling timing (#17382)
|
2025-11-20 13:40:10 +02:00 |
|
console.cpp
|
console : utf-8 fix for windows stdin (#9690)
|
2024-09-30 11:23:42 +03:00 |
|
console.h
|
gguf : new file format with flexible meta data (beta) (#2398)
|
2023-08-21 23:07:43 +03:00 |
|
download.cpp
|
cmake : move OpenSSL linking to vendor/cpp-httplib (#17177)
|
2025-11-12 12:32:50 +01:00 |
|
download.h
|
arg: add --cache-list argument to list cached models (#17073)
|
2025-11-08 21:54:14 +01:00 |
|
http.h
|
common: introduce http.h for httplib-based client (#16373)
|
2025-10-01 20:22:18 +03:00 |
|
json-partial.cpp
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
json-partial.h
|
sync : vendor (#13901)
|
2025-05-30 16:25:45 +03:00 |
|
json-schema-to-grammar.cpp
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
json-schema-to-grammar.h
|
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2025-11-18 18:54:15 +01:00 |
|
llguidance.cpp
|
llguidance : set tokenizer slices to default (#13424)
|
2025-05-10 17:19:52 +02:00 |
|
log.cpp
|
mtmd: add mtmd_log_set (#17268)
|
2025-11-14 15:56:19 +01:00 |
|
log.h
|
mtmd: add mtmd_log_set (#17268)
|
2025-11-14 15:56:19 +01:00 |
|
ngram-cache.cpp
|
ggml : portability fixes for VS 2017 (#12150)
|
2025-03-04 18:53:26 +02:00 |
|
ngram-cache.h
|
llama : use LLAMA_TOKEN_NULL (#11062)
|
2025-01-06 10:52:15 +02:00 |
|
regex-partial.cpp
|
`common`: add partial regex support (#12808)
|
2025-05-14 19:50:57 +01:00 |
|
regex-partial.h
|
`common`: add partial regex support (#12808)
|
2025-05-14 19:50:57 +01:00 |
|
sampling.cpp
|
common : more accurate sampling timing (#17382)
|
2025-11-20 13:40:10 +02:00 |
|
sampling.h
|
sampling : optimize samplers by reusing bucket sort (#15665)
|
2025-08-31 20:41:02 +03:00 |
|
speculative.cpp
|
sampling : optimize samplers by reusing bucket sort (#15665)
|
2025-08-31 20:41:02 +03:00 |
|
speculative.h
|
server : implement universal assisted decoding (#12635)
|
2025-07-31 14:25:23 +02:00 |