Aleksander Grygier
c74065de75
chore: update webui build output
2026-02-13 13:17:51 +01:00
Aleksander Grygier
e6ad864984
feat: UI improvements
2026-02-13 13:17:51 +01:00
Pascal
cff237cb3e
webui: raw tool result display, strip only leading/trailing newlines to preserve indentation
2026-02-13 13:17:33 +01:00
Pascal
afb79b2970
webui: split raw output into backend parsing and frontend display options
2026-02-13 13:17:33 +01:00
Pascal
18efdabb12
webui: remove legacy wrapper and restore WebSocket transport
2026-02-13 13:17:33 +01:00
Pascal
a13782a4d1
webui: remove unused imports
2026-02-13 13:17:33 +01:00
Aleksander Grygier
d548bf27dd
chore: update webui build output
2026-02-13 13:17:33 +01:00
Aleksander Grygier
bdd5958f6d
feat: Improve agentic tool call streaming display with 'in progress' state
2026-02-13 13:17:32 +01:00
Aleksander Grygier
a9c2ea7a8e
feat: Enhance MCP server dropdown with search, popularity sorting, and per-chat overrides
2026-02-13 13:17:32 +01:00
Aleksander Grygier
dfce09b34b
feat: Add per-chat MCP server overrides
2026-02-13 13:17:32 +01:00
Aleksander Grygier
54374edecd
chore: update webui build output
2026-02-13 13:17:32 +01:00
Aleksander Grygier
b763a4cc69
feat: Add image load error fallback in MarkdownContent
2026-02-13 13:17:32 +01:00
Aleksander Grygier
af9a76b6dc
feat: Implement lazy MCP client shutdown
2026-02-13 13:17:32 +01:00
Aleksander Grygier
c7870a3903
feat: Enhance tool call streaming UI and output format
2026-02-13 13:17:32 +01:00
Aleksander Grygier
fb5e464fe7
feat: Display and manage servers in ChatForm actions
2026-02-13 13:17:32 +01:00
Aleksander Grygier
dc7a3f33ba
feat: Integrate server management dialog into chat settings
2026-02-13 13:03:15 +01:00
Aleksander Grygier
0b13c95519
feat: Implement dedicated server management UI components
2026-02-13 13:03:15 +01:00
Aleksander Grygier
8df7e4a54f
refactor: Centralize health check logic in store
2026-02-13 13:03:15 +01:00
Aleksander Grygier
9a8cae462e
feat: Enhance server config with headers and schema normalization
2026-02-13 13:03:15 +01:00
Aleksander Grygier
bc2d879dea
feat: Add McpLogo Svelte component
2026-02-13 13:03:15 +01:00
Aleksander Grygier
42d52605d9
refactor: Consolidate UI CSS classes into shared module
2026-02-13 13:03:15 +01:00
Aleksander Grygier
6c95020b06
chore: update webui build output
2026-02-13 12:57:23 +01:00
Aleksander Grygier
62dbc9f654
feat: Raw LLM output switch per message
2026-02-13 12:57:23 +01:00
Aleksander Grygier
284425097b
refactor: Tool call handling
2026-02-13 12:57:03 +01:00
Aleksander Grygier
5beeb88a37
docs: Update high-level architecture diagrams for MCP integration
2026-02-13 12:55:42 +01:00
Aleksander Grygier
acdd30e3af
feat: Add AgenticContent component for enhanced tool call rendering
2026-02-13 12:55:42 +01:00
Aleksander Grygier
49a8c8b148
refactor: Update ChatStore to leverage mcpStore for agentic flow
2026-02-13 12:55:42 +01:00
Aleksander Grygier
5b582beb75
feat: Implement agentic orchestration within ChatService
2026-02-13 12:55:03 +01:00
Aleksander Grygier
391479edb2
feat: Introduce reactive mcpStore for client lifecycle management
2026-02-13 12:55:03 +01:00
Aleksander Grygier
7e184c174d
feat: Refactor MCP client to use official SDK
2026-02-13 12:55:03 +01:00
Aleksander Grygier
1a041a5b9b
feat: Add @modelcontextprotocol/sdk and zod dependencies
2026-02-13 12:55:03 +01:00
Aleksander Grygier
2325d2a50d
refactor: Update Agentic and MCP config parsing to use new utils and constants
2026-02-13 12:55:03 +01:00
Aleksander Grygier
0c24db3178
feat: Centralize MCP and Agentic type definitions and constants
2026-02-13 12:55:02 +01:00
Aleksander Grygier
26a19183b7
feat: Introduce common utility functions
2026-02-13 12:55:02 +01:00
Pascal
14f6728ef1
webui: use normalizedMessages after upstream refactor
2026-02-13 12:55:02 +01:00
Pascal
cb99ed9f71
webui: MCP client with low coupling to current codebase
2026-02-13 12:55:02 +01:00
Sigbjørn Skjæret
b2ecc0cdb4
support --verbose-prompt ( #19576 )
2026-02-13 12:49:10 +01:00
Aman Gupta
5065da554e
CUDA: loop over ne2*ne3 in case it overflows ( #19538 )
...
* CUDA: loop over ne2*ne3 in case it overflows
* use fastdiv
2026-02-13 17:01:40 +05:30
Aleksander Grygier
5174d7206f
webui: UI and routing fixes ( #19586 )
...
* chore: update webui build output
* chore: update webui build output
* fix: Scroll issues in DropdownMenuSearchable
* webui: fix redirect to root ignoring base path
* fix: Word wrapping
* fix: remove obsolete modality UI tests causing CI failures
- Remove VisionModality/AudioModality test stories
- Remove mockServerProps usage and imports
- Simplify Default test (remove dropdown interaction checks)
- Simplify FileAttachments test (remove mocks)
* feat: Improve formatting performance time
---------
Co-authored-by: Pascal <admin@serveurperso.com>
2026-02-13 12:31:00 +01:00
Oliver Simons
43919b7f4f
CUDA: Do not mutate cgraph for fused ADDs ( #19566 )
...
* Do not mutate cgraph for fused ADDs
1. We should try to minimize in-place changes to the incoming
ggml_cgraph where possible (those should happen in graph_optimize)
2. Modifying in-place leads to an additional, unnecessary graph capture
step as we store the properties before modifying the graph in-place
in the cuda-backend
* Assert ggml_tensor is trivially copyable
* Update ggml/src/ggml-cuda/ggml-cuda.cu
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
---------
Co-authored-by: Aman Gupta <amangupta052@gmail.com>
2026-02-13 15:07:55 +05:30
Pavan Shinde
423cf0b26f
docs : fix broken link and typo ( #19560 )
2026-02-13 09:38:09 +01:00
ymcki
33a56f90a6
model : Kimi Linear fix conv state update ( #19531 )
...
* fix conv state update for llama-server parallel serving
---------
Co-authored-by: Piotr Wilkin (ilintar) <piotr.wilkin@syndatis.com>
2026-02-13 09:10:18 +01:00
Adrien Gallouët
25224c8021
llama : remove deprecated codecvt ( #19565 )
...
Using the same conversion function ensures a consistent matching between
the regex pattern and the text.
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-02-13 06:43:53 +01:00
Adrien Gallouët
2f5d8f8edc
vendor : update BoringSSL to 0.20260211.0 ( #19562 )
...
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-02-13 06:43:26 +01:00
Georgi Gerganov
bb96bfd361
memory : fix kv cache size for hybrid models ( #19559 )
2026-02-13 07:36:24 +02:00
Georgi Gerganov
0644baefde
metal : improve concurrency ( #19555 )
2026-02-13 07:35:57 +02:00
Georgi Gerganov
490eb96b88
metal : support GGML_OP_SET ( #19548 )
2026-02-13 07:34:52 +02:00
Shupei Fan
3bb78133ab
hexagon: fix typo in vtcm_needs_release ( #19545 )
2026-02-12 15:07:49 -08:00
lhez
79cc0f2daf
opencl: add basic support for q4_1 ( #19534 )
...
* opencl: add q4_1 mv
* opencl: clean up
* opencl: add flattened q4_1 mv
* opencl: clean up
* opencl: add basic q4_1 mm
* opencl: fix whitespace
* opencl: add general q4_0 mm
2026-02-12 14:52:37 -08:00
Georgi Gerganov
338085c69e
args : add -kvu to llama-parallel ( #19577 )
2026-02-12 21:52:41 +02:00