Commit Graph

748 Commits

Author SHA1 Message Date
Aleksander Grygier 80ab2a5d1f feat: Add cache configuration constants 2026-01-27 13:10:24 +01:00
Aleksander Grygier 8421d056be chore: update webui build output 2026-01-27 13:01:12 +01:00
Aleksander Grygier 25df25a126 refactor: Adapt message child components to MessageEditContext 2026-01-27 13:00:37 +01:00
Aleksander Grygier 93992b10a7 refactor: Encapsulate message editing state and actions in ChatMessage.svelte 2026-01-27 13:00:37 +01:00
Aleksander Grygier cbcd7956c8 refactor: Centralize chat-wide actions in ChatMessages.svelte 2026-01-27 13:00:36 +01:00
Aleksander Grygier 6b6ebd6bca feat: Introduce Chat Actions and Message Edit Contexts 2026-01-27 13:00:36 +01:00
Aleksander Grygier 357fd8d591 chore: update webui build output 2026-01-27 12:23:47 +01:00
Aleksander Grygier 6cf823fb92 refactor: Components 2026-01-27 12:20:16 +01:00
Aleksander Grygier 8a8cd78237 refactor: Improve styling and overflow handling for ChatMessageMcpPromptContent 2026-01-27 11:56:55 +01:00
Aleksander Grygier 8ca3ffa076 feat: Add support for pasting MCP prompt attachments in ChatForm 2026-01-27 11:56:55 +01:00
Aleksander Grygier 770f993086 feat: Implement clipboard serialization/deserialization for MCP prompts 2026-01-27 11:56:55 +01:00
Aleksander Grygier 99d177d442 feat: Introduce clipboard types for MCP prompt attachments 2026-01-27 11:56:55 +01:00
Aleksander Grygier 69682dcb1a fix: Edit Mode with MCP Prompt in message 2026-01-27 11:30:44 +01:00
Aleksander Grygier f22e2be4d0 refactor: Use Popover for Chat Form Prompt Picker 2026-01-27 11:22:30 +01:00
Aleksander Grygier 7eff7a31de feat: UI improvements 2026-01-27 11:07:20 +01:00
Aleksander Grygier d4a6815ea9 chore: update webui build output 2026-01-27 10:40:34 +01:00
Aleksander Grygier b834f165a4 Merge remote-tracking branch 'origin/allozaur/mcp-mvp' into allozaur/mcp-mvp 2026-01-27 10:40:11 +01:00
Aleksander Grygier e35adedb4f chore: update webui build output 2026-01-27 10:27:40 +01:00
Aleksander Grygier 1b7f576baf refactor: Components 2026-01-27 10:26:14 +01:00
Aleksander Grygier b8221e8915 refactor: Utils 2026-01-27 09:04:41 +01:00
Pascal 5e71525cac webui: remove unused sessionId, SDK handles it automatically 2026-01-26 16:41:44 +01:00
Pascal 19c32a4c96 webui: remove unused sessionId, SDK handles it automatically 2026-01-26 16:13:07 +01:00
Aleksander Grygier d444c4a7e5 chore: update webui build output 2026-01-26 15:40:02 +01:00
Aleksander Grygier 1d518cac06 fix: Wait for all MCP Servers Health Checks to load 2026-01-26 15:38:10 +01:00
Aleksander Grygier 82f26ad8e4 refactor: Cleanup 2026-01-26 15:33:27 +01:00
Aleksander Grygier 5bf1c86635 refactor: Cleanup
refactor: Cleanup
refactor: Cleanup
refactor: Cleanup
2026-01-26 15:28:50 +01:00
Aleksander Grygier 7b127db90c chore: update webui build output 2026-01-26 15:07:47 +01:00
Aleksander Grygier 717a868c23 feat: Mcp Server Selector 2026-01-26 15:03:05 +01:00
Aleksander Grygier e566d6641e fix: Scroll issues in DropdownMenuSearchable 2026-01-26 14:41:15 +01:00
Aleksander Grygier d675f403e3 chore: update webui build output 2026-01-26 14:33:58 +01:00
Aleksander Grygier ee0f0b277f feat: Improve Code blocks rendering + add auto scroll + improve global scroll bar behavior 2026-01-26 14:32:40 +01:00
Aleksander Grygier 6586ae71d2 chore: update webui build output 2026-01-26 12:34:21 +01:00
Aleksander Grygier c631e26a3f refactor: Components imports/exports structure & documentation 2026-01-26 12:30:53 +01:00
Aleksander Grygier b7d1de68c3 refactor: Cleanup 2026-01-26 09:54:44 +01:00
Aleksander Grygier 0a66568fc9 chore: update webui build output 2026-01-26 09:37:27 +01:00
Aleksander Grygier fa0cad2e6e refactor: Componentize Chat Form Prompt Picker 2026-01-26 09:36:13 +01:00
Aleksander Grygier 176abf3175 refactor: Utility function 2026-01-26 09:00:41 +01:00
Aleksander Grygier 5ee232d81c refactor: Use store methods 2026-01-26 08:52:57 +01:00
Aleksander Grygier ff0e927be2 chore: update webui build output 2026-01-25 13:38:25 +01:00
Aleksander Grygier ee9efae203 refactor: Enums 2026-01-25 13:37:08 +01:00
Aleksander Grygier 7f5284d597 refactor: Cleanup
refactor: Cleanup
refactor: Cleanup
refactor: Cleanup
2026-01-25 13:13:11 +01:00
Georgi Gerganov 080b161995
completion : fix prompt cache for recurrent models (#19045) 2026-01-25 09:12:50 +02:00
Daniel Bevenius 16639ba217
common : use two decimal places for float arg help messages (#19048)
* common : use two decimal places for float arg help messages

This commit updates the help messages for various command-line arguments
in arg.cpp to display floating-point default values with two decimal
places instead of one.

The motivation for this changes is that currently only having one decimal
place means that values generated using --help or llama-gen-docs will not
display the correct values.

For example, currently the value of top-p in tools/server/README.md is
`0.9`, but the default value is actually '0.95'. And running
llama-gen-docs does not update this value as it uses the output from the
help message, which shows only one decimal place, so the values look
like they are unchanged.

* docs : run llama-gen-docs to update docs
2026-01-25 07:31:42 +01:00
Aleksander Grygier 97642211a9 chore: update webui build output 2026-01-25 02:10:25 +01:00
Aleksander Grygier fc377123b7 refactor: Simplify MCP errors 2026-01-25 02:09:12 +01:00
Aleksander Grygier 202262c2dc chore: update webui build output 2026-01-25 01:44:14 +01:00
Aleksander Grygier b58b823b57 refactor: Types 2026-01-25 01:39:49 +01:00
Aleksander Grygier ba39f8cc7b chore: update webui build output 2026-01-25 01:21:34 +01:00
Aleksander Grygier 9bcfdc3483 refactor: DRY 2026-01-25 01:17:59 +01:00
Aleksander Grygier e7ff091881
chore: Add deprecation comment 2026-01-25 01:05:28 +01:00
Aleksander Grygier 1c843b2863 chore: update webui build output 2026-01-25 01:04:34 +01:00
Aleksander Grygier 5dfc520d67 refactor: Cleanup 2026-01-25 00:48:21 +01:00
Aleksander Grygier 6daa39994c refactor: Naming & Enums 2026-01-25 00:32:37 +01:00
Aleksander Grygier 2562dc50bd chore: update webui build output 2026-01-25 00:32:16 +01:00
Aleksander Grygier 372202632e refactor: Cleanup 2026-01-25 00:31:49 +01:00
Aleksander Grygier ba230c5cce refactor: Naming + remove redundant component 2026-01-24 23:58:17 +01:00
Aleksander Grygier f7b5f62586 refactor: Remove unused code 2026-01-24 23:45:06 +01:00
Aleksander Grygier 22d9e645aa chore: update webui build output 2026-01-24 23:39:04 +01:00
Aleksander Grygier d938994395 refactor: Cleanup 2026-01-24 23:38:37 +01:00
Johannes Gäßler e9fd8dcab4
llama-fit-params: keep explicit --ctx-size 0 (#19070) 2026-01-24 22:13:08 +01:00
Aleksander Grygier fc4c392dce chore: update webui build output 2026-01-24 20:54:24 +01:00
Aleksander Grygier 79e606eb99 refactor: Constants 2026-01-24 20:52:19 +01:00
Aleksander Grygier 3d7426cdd4 refactor: Cleanup 2026-01-24 20:47:32 +01:00
Aleksander Grygier 8bf2d38da1 chore: update webui build output 2026-01-24 20:32:53 +01:00
Aleksander Grygier 14911e51fc feat: MCP Prompts implementation improvements 2026-01-24 20:30:52 +01:00
Aleksander Grygier 801ef93522 refactor: Message Height CSS Variable 2026-01-24 19:15:38 +01:00
Aleksander Grygier 13f756421c refactor: Enums 2026-01-24 18:37:43 +01:00
Pascal 85b8da45f9 fix: resolve TypeScript error in tool response content 2026-01-24 18:04:01 +01:00
Pascal 9ddc54b668 webui: enable vision in agentic tool responses
- Include images from all message roles (not just user)
- Add multipart content support for tool responses
- Images from MCP tools now accessible in same agentic turn
2026-01-24 17:58:20 +01:00
Aleksander Grygier 172e93d494 Merge remote-tracking branch 'ggml-org/master' into allozaur/mcp-mvp 2026-01-24 15:13:58 +01:00
Aleksander Grygier da9c245838 chore: update webui build output 2026-01-24 13:59:52 +01:00
Aleksander Grygier 7c4bedda87 feat: Improve formatting performance time 2026-01-24 13:58:23 +01:00
Aleksander Grygier c39c6ef436 fix: System prompt sorting 2026-01-24 13:44:41 +01:00
Aleksander Grygier 2601bf0f59 fix: Save draft message in Chat Form when adding System Prompt from new chat view 2026-01-24 13:32:49 +01:00
Aleksander Grygier a647edfc0b fix: Chat Form submission 2026-01-24 12:33:24 +01:00
Aleksander Grygier bd16b6145c chore: update webui build output 2026-01-24 01:32:36 +01:00
Aleksander Grygier 8428741034 feat: MCP Prompts WIP 2026-01-24 01:26:17 +01:00
Aleksander Grygier 3d88d0b6b2 chore: update webui build output 2026-01-23 15:21:56 +01:00
Aleksander Grygier 9c391d8e0d feat: UI improvements 2026-01-23 15:21:03 +01:00
Aldehir Rojas a3e812811d
cli : load parser definition (#19031)
* cli : load parser definition

* cont : only unload if a parser is defined
2026-01-22 20:31:22 -06:00
Xuan-Son Nguyen 51fa458a92
server : support preserving reasoning_content in assistant message (#18994)
* support reasoning_content input

* report template caps to webui

* add docs

* rm commented code
2026-01-22 21:30:06 +01:00
Xuan-Son Nguyen 4e595b250a
server: do not log certain endpoints (avoid log spam) (#19028) 2026-01-22 19:24:37 +01:00
Aleksander Grygier 963711cccb chore: update webui build output 2026-01-22 18:20:55 +01:00
Aleksander Grygier 6018f85c65 feat: Architectural improvements 2026-01-22 18:19:37 +01:00
Aleksander Grygier c02e83c32a feat: Per-conversation agentic loop state 2026-01-22 17:38:51 +01:00
Xuan-Son Nguyen 9eb5bfec1a
mtmd : update docs to use llama_model_n_embd_inp (#18999) 2026-01-22 14:36:32 +01:00
손희준 c6926d1d95
server: Reorder methods in `server-task.cpp` (#19016)
* Move `task_result_state::update_chat_msg` to match with header

* Move `server_task_result_cmpl_partial::to_json_anthropic()` to match with header

---------

Co-authored-by: openingnow <>
2026-01-22 14:36:04 +01:00
Hendrik Erz 3802d3c78f
fix: Use `tabular-nums` for chat message statistics (#18915)
* fix: Use `tabular-nums` for chat message statistics

* fix: Rebuild WebUI
2026-01-21 18:46:01 +01:00
손희준 fbbf3ad190
server: /v1/responses (partial) (#18486)
* from previous PR

* Make instruction(system) as first message

* Convert [input_message] (text/image/file)

* Rename convert_responses_to_chatcmpl(body) -> response_body

* Initial tool call support

* Erase instructions field from chatcmpl body

* Feed reasoning texts to chat template

* Use std::vector instead of opaque json array

* Make output_item.added events consistent

* Move `server_task_result_cmpl_partial::update` from header to source

* Match ID of output_item.added and .done events

* Add function_call only if there is no "fc_" prefix

* Add function call output at non-streaming API

* Test if ID is persistent

* Add doc

* Fix style - use trailing comma

* Rewrite state management

* catch up with upstream/master

* Fix style - "type" is the first item of SSE data

* Explicitly check "instructions" from response_body

* Make lambdas static

* Check if reasoning content exists

* Add `oai_resp_id` to task_result_state(also initialized at ctor), server_task_result_cmpl_partial, and server_task_result_cmpl_final

* Reject `input_file` since it is not supported by chatcmpl

* Add "fc_" prefix to non-straming function call id as coderabbit pointed out

---------

Co-authored-by: openingnow <>
2026-01-21 17:47:23 +01:00
Adrien Gallouët 1c7cf94b22
common, server : use the same User-Agent by default (#18957)
This commit also ensures that if a custom User-Agent is used, it will be
the only one sent.

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2026-01-20 18:28:43 +01:00
Xuan-Son Nguyen 2c1f199653
cli : fix reasoning responses in CLI (#18961)
* cli : fix reasoning responses in CLI

* fix build

* fix build (2)
2026-01-20 18:23:25 +01:00
Xuan-Son Nguyen 6df686bee6
server : refactor oai_parser_opt, move it to server_chat_params (#18937)
* server_chat_params

* move chat format into CLI

* use meta whenever possible

* clean up, no more chatml fallback
2026-01-19 23:28:01 +01:00
Lennart Austenfeld 18361c579c
server: fix memory reservations in populate_token_probs (#18787) 2026-01-19 19:13:31 +01:00
Aleksander Grygier 39d0ff485d chore: update webui build output 2026-01-19 19:02:40 +01:00
Aleksander Grygier 8a95ec3ea6 feat: Improve MCP Server selection UI + lazy load health checks 2026-01-19 19:01:32 +01:00
Aleksander Grygier cafb9c09d3 feat: UI improvements 2026-01-19 16:56:02 +01:00
Aleksander Grygier 54192b05fb feat: Simplify MCP server enabling logic per chat
Refactors MCP server enabling logic to remove the dependency on global settings.

This simplifies the logic by directly checking the per-chat override status, and removes the need to pass the global enabled state as a parameter.

Additionally:
- Only shows MCP servers that are enabled in settings in the selector.
- Sorts the servers by whether they are enabled for the current chat.
2026-01-19 16:43:53 +01:00
Aleksander Grygier 62ed7f112d chore: update webui build output 2026-01-19 16:26:16 +01:00
Aleksander Grygier d37683942b fix: Missing onModelChange callback running assistant message re-generation 2026-01-19 16:25:49 +01:00
Pascal d6dfe8e064 chore: update webui build output 2026-01-19 12:12:52 +01:00