Commit Graph

8062 Commits

Author SHA1 Message Date
Aleksander Grygier bdd5958f6d feat: Improve agentic tool call streaming display with 'in progress' state 2026-02-13 13:17:32 +01:00
Aleksander Grygier a9c2ea7a8e feat: Enhance MCP server dropdown with search, popularity sorting, and per-chat overrides 2026-02-13 13:17:32 +01:00
Aleksander Grygier dfce09b34b feat: Add per-chat MCP server overrides 2026-02-13 13:17:32 +01:00
Aleksander Grygier 54374edecd chore: update webui build output 2026-02-13 13:17:32 +01:00
Aleksander Grygier b763a4cc69 feat: Add image load error fallback in MarkdownContent 2026-02-13 13:17:32 +01:00
Aleksander Grygier af9a76b6dc feat: Implement lazy MCP client shutdown 2026-02-13 13:17:32 +01:00
Aleksander Grygier c7870a3903 feat: Enhance tool call streaming UI and output format 2026-02-13 13:17:32 +01:00
Aleksander Grygier fb5e464fe7 feat: Display and manage servers in ChatForm actions 2026-02-13 13:17:32 +01:00
Aleksander Grygier dc7a3f33ba feat: Integrate server management dialog into chat settings 2026-02-13 13:03:15 +01:00
Aleksander Grygier 0b13c95519 feat: Implement dedicated server management UI components 2026-02-13 13:03:15 +01:00
Aleksander Grygier 8df7e4a54f refactor: Centralize health check logic in store 2026-02-13 13:03:15 +01:00
Aleksander Grygier 9a8cae462e feat: Enhance server config with headers and schema normalization 2026-02-13 13:03:15 +01:00
Aleksander Grygier bc2d879dea feat: Add McpLogo Svelte component 2026-02-13 13:03:15 +01:00
Aleksander Grygier 42d52605d9 refactor: Consolidate UI CSS classes into shared module 2026-02-13 13:03:15 +01:00
Aleksander Grygier 6c95020b06 chore: update webui build output 2026-02-13 12:57:23 +01:00
Aleksander Grygier 62dbc9f654 feat: Raw LLM output switch per message 2026-02-13 12:57:23 +01:00
Aleksander Grygier 284425097b refactor: Tool call handling 2026-02-13 12:57:03 +01:00
Aleksander Grygier 5beeb88a37 docs: Update high-level architecture diagrams for MCP integration 2026-02-13 12:55:42 +01:00
Aleksander Grygier acdd30e3af feat: Add AgenticContent component for enhanced tool call rendering 2026-02-13 12:55:42 +01:00
Aleksander Grygier 49a8c8b148 refactor: Update ChatStore to leverage mcpStore for agentic flow 2026-02-13 12:55:42 +01:00
Aleksander Grygier 5b582beb75 feat: Implement agentic orchestration within ChatService 2026-02-13 12:55:03 +01:00
Aleksander Grygier 391479edb2 feat: Introduce reactive mcpStore for client lifecycle management 2026-02-13 12:55:03 +01:00
Aleksander Grygier 7e184c174d feat: Refactor MCP client to use official SDK 2026-02-13 12:55:03 +01:00
Aleksander Grygier 1a041a5b9b feat: Add @modelcontextprotocol/sdk and zod dependencies 2026-02-13 12:55:03 +01:00
Aleksander Grygier 2325d2a50d refactor: Update Agentic and MCP config parsing to use new utils and constants 2026-02-13 12:55:03 +01:00
Aleksander Grygier 0c24db3178 feat: Centralize MCP and Agentic type definitions and constants 2026-02-13 12:55:02 +01:00
Aleksander Grygier 26a19183b7 feat: Introduce common utility functions 2026-02-13 12:55:02 +01:00
Pascal 14f6728ef1 webui: use normalizedMessages after upstream refactor 2026-02-13 12:55:02 +01:00
Pascal cb99ed9f71 webui: MCP client with low coupling to current codebase 2026-02-13 12:55:02 +01:00
Sigbjørn Skjæret b2ecc0cdb4
support --verbose-prompt (#19576) 2026-02-13 12:49:10 +01:00
Aman Gupta 5065da554e
CUDA: loop over ne2*ne3 in case it overflows (#19538)
* CUDA: loop over ne2*ne3 in case it overflows

* use fastdiv
2026-02-13 17:01:40 +05:30
Aleksander Grygier 5174d7206f
webui: UI and routing fixes (#19586)
* chore: update webui build output

* chore: update webui build output

* fix: Scroll issues in DropdownMenuSearchable

* webui: fix redirect to root ignoring base path

* fix: Word wrapping

* fix: remove obsolete modality UI tests causing CI failures

- Remove VisionModality/AudioModality test stories
- Remove mockServerProps usage and imports
- Simplify Default test (remove dropdown interaction checks)
- Simplify FileAttachments test (remove mocks)

* feat: Improve formatting performance time

---------

Co-authored-by: Pascal <admin@serveurperso.com>
2026-02-13 12:31:00 +01:00
Oliver Simons 43919b7f4f
CUDA: Do not mutate cgraph for fused ADDs (#19566)
* Do not mutate cgraph for fused ADDs

1. We should try to minimize in-place changes to the incoming
   ggml_cgraph where possible (those should happen in graph_optimize)
2. Modifying in-place leads to an additional, unnecessary graph capture
   step as we store the properties before modifying the graph in-place
   in the cuda-backend

* Assert ggml_tensor is trivially copyable

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Aman Gupta <amangupta052@gmail.com>

---------

Co-authored-by: Aman Gupta <amangupta052@gmail.com>
2026-02-13 15:07:55 +05:30
Pavan Shinde 423cf0b26f
docs : fix broken link and typo (#19560) 2026-02-13 09:38:09 +01:00
ymcki 33a56f90a6
model : Kimi Linear fix conv state update (#19531)
* fix conv state update for llama-server parallel serving

---------

Co-authored-by: Piotr Wilkin (ilintar) <piotr.wilkin@syndatis.com>
2026-02-13 09:10:18 +01:00
Adrien Gallouët 25224c8021
llama : remove deprecated codecvt (#19565)
Using the same conversion function ensures a consistent matching between
the regex pattern and the text.

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-02-13 06:43:53 +01:00
Adrien Gallouët 2f5d8f8edc
vendor : update BoringSSL to 0.20260211.0 (#19562)
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-02-13 06:43:26 +01:00
Georgi Gerganov bb96bfd361
memory : fix kv cache size for hybrid models (#19559) 2026-02-13 07:36:24 +02:00
Georgi Gerganov 0644baefde
metal : improve concurrency (#19555) 2026-02-13 07:35:57 +02:00
Georgi Gerganov 490eb96b88
metal : support GGML_OP_SET (#19548) 2026-02-13 07:34:52 +02:00
Shupei Fan 3bb78133ab
hexagon: fix typo in vtcm_needs_release (#19545) 2026-02-12 15:07:49 -08:00
lhez 79cc0f2daf
opencl: add basic support for q4_1 (#19534)
* opencl: add q4_1 mv

* opencl: clean up

* opencl: add flattened q4_1 mv

* opencl: clean up

* opencl: add basic q4_1 mm

* opencl: fix whitespace

* opencl: add general q4_0 mm
2026-02-12 14:52:37 -08:00
Georgi Gerganov 338085c69e
args : add -kvu to llama-parallel (#19577) 2026-02-12 21:52:41 +02:00
Aleksander Grygier 4c61875bf8
webui: Add switcher to Chat Message UI to show raw LLM output (#19571) 2026-02-12 19:55:51 +01:00
Adrien Gallouët 4b385bfcf8
vendor : update cpp-httplib (#19537)
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-02-12 16:11:22 +01:00
Christian Schmitz f488429380
llama : update outdated comment in llama.h (#19428)
* Updated documentation

Model is no longer a parameter

* llama : fix trailing whitespace in comment

---------

Co-authored-by: Daniel Bevenius <daniel.bevenius@gmail.com>
2026-02-12 15:52:57 +01:00
Aleksander Grygier 4d688f9ebb
(webui) FEATURE: Enable adding or injecting System Message into chat (#19556)
* feat: Enable adding System Prompt per-chat

* fix: Save draft message in Chat Form when adding System Prompt from new chat view

* fix: Proper system message deletion logic

* chore: Formatting

* chore: update webui build output
2026-02-12 13:56:08 +01:00
Daniel Bevenius ff599039a9
scripts : add support for forks in pr2wt.sh (#19540)
This commit adds support for using the pr2wt.sh (pull request to
workspace) script with forks of upstream llama.cpp.
2026-02-12 13:14:28 +01:00
Aleksander Grygier f486ce9f30
(webui) REFACTOR: UI primitives and polish (#19551)
* webui: UI primitives and polish (non-MCP)

* chore: update webui build output
2026-02-12 12:21:00 +01:00
Aleksander Grygier 38adc7d469
WebUI Architecture Cleanup (#19541)
* webui: architecture foundation (non-MCP core refactors)

* chore: update webui build output
2026-02-12 11:22:27 +01:00