Commit Graph

7304 Commits

Author SHA1 Message Date
Aleksander Grygier 493ef08723 refactor: Utils imports + move types to `app.d.ts` 2025-11-29 02:33:37 +01:00
Aleksander Grygier ce9c9afe0d chore: update webui build output 2025-11-29 01:40:00 +01:00
Aleksander Grygier 2464e06028 feat: Improve UI sidebar background color 2025-11-29 01:39:40 +01:00
Aleksander Grygier 27b152267f refactor: Constants 2025-11-29 01:38:02 +01:00
Aleksander Grygier 648d2deebc feat: Attachment logic & UI improvements 2025-11-29 01:36:05 +01:00
Aleksander Grygier d49d97c642
refactor: Cleanup 2025-11-29 00:51:18 +01:00
Aleksander Grygier f50ce7b5b4
refactor: Cleanup 2025-11-29 00:50:16 +01:00
Aleksander Grygier 4d16459b4c
re 2025-11-29 00:49:46 +01:00
Aleksander Grygier c76de5e0ad
refactor: Cleanup 2025-11-29 00:49:20 +01:00
Aleksander Grygier 2f97dbfa65
docs: Add info comment 2025-11-29 00:49:03 +01:00
Aleksander Grygier 1adf173dd6 refactor: Cleanup 2025-11-28 19:36:03 +01:00
Aleksander Grygier dd30810d0a fix: Modality detection improvement for text-based PDF attachments 2025-11-28 19:30:32 +01:00
Aleksander Grygier 171a0926a1 chore: update webui build output 2025-11-28 16:00:44 +01:00
Aleksander Grygier 68b653ef45 refactor: DRY `getAttachmentDisplayItems` function + fix UI 2025-11-28 15:58:52 +01:00
Aleksander Grygier 1cf5daa8c0 refactor: Cleanup 2025-11-28 15:56:41 +01:00
Aleksander Grygier 04ef4a06e2 chore: update webui build output 2025-11-28 15:44:43 +01:00
Aleksander Grygier 5fadd0fe18 refactor: Components naming 2025-11-28 15:39:47 +01:00
Aleksander Grygier 3470b12b76 chore: update webui build output 2025-11-28 15:09:55 +01:00
Aleksander Grygier eed1bd9b97 refactor: Enhance model info and attachment handling 2025-11-28 15:08:41 +01:00
Aleksander Grygier 491fe2d3f7 feat: Update logic for PDF as Image 2025-11-28 13:10:00 +01:00
Aleksander Grygier bc577266b9 docs: Architecture documentation 2025-11-27 22:04:20 +01:00
Aleksander Grygier db479523ec feat: Condition available models based on modality + better model loading strategy & UX 2025-11-27 19:13:05 +01:00
Aleksander Grygier 9086bc30bd feat: Improve statistic badges 2025-11-27 14:12:21 +01:00
Aleksander Grygier d73353732f refactor: Architecture cleanup 2025-11-27 14:03:25 +01:00
Aleksander Grygier 78ead49830 Merge remote-tracking branch 'ngxson/xsn/server_model_management_v1_2' into allozaur/server_model_management_v1_2 2025-11-27 13:48:21 +01:00
Aleksander Grygier 6a3d6e79d2 refactor: Services/Stores syntax + logic improvements
Refactors components to access stores directly instead of using exported getter functions.

This change centralizes store access and logic, simplifying component code and improving maintainability by reducing the number of exported functions and promoting direct store interaction.

Removes exported getter functions from `chat.svelte.ts`, `conversations.svelte.ts`, `models.svelte.ts` and `settings.svelte.ts`.
2025-11-27 13:44:49 +01:00
Aleksander Grygier 69065ddc56 fix: UI 2025-11-27 11:27:58 +01:00
Aleksander Grygier 6b95118abc refactor: Processing state reactivity 2025-11-27 11:11:45 +01:00
Aleksander Grygier 2a5922b1f6 chore: update webui build output 2025-11-26 17:52:40 +01:00
Aleksander Grygier 13e7988459 refactor: Model modality handling 2025-11-26 17:51:25 +01:00
Xuan Son Nguyen 1493ee09ea tmp webui build 2025-11-26 17:43:27 +01:00
Aleksander Grygier d6ee3d133a refactor: Server store 2025-11-26 17:16:41 +01:00
Aleksander Grygier 456828b365 refactor: Chat requests abort handling 2025-11-26 16:48:13 +01:00
Aleksander Grygier 42483f463d refactor: Remove ConversationsService 2025-11-26 16:45:07 +01:00
Xuan Son Nguyen becc602612 Merge branch 'master' into xsn/server_model_management_v1_2 2025-11-26 16:21:57 +01:00
Xuan Son Nguyen e2731c3767 set hf_repo/docker_repo as model alias when posible 2025-11-26 15:57:20 +01:00
Xuan Son Nguyen e40f35fb61 remove support for extra args 2025-11-26 15:43:27 +01:00
Aleksander Grygier ddf98bdf28 refactor: Improve API header management via utility functions 2025-11-26 15:36:09 +01:00
Aleksander Grygier 9431f358b8 chore: update webui build output 2025-11-26 15:07:12 +01:00
Aleksander Grygier 284557cd2f feat: Improve model loading/unloading status updates 2025-11-26 15:06:11 +01:00
xctan 6ab4e50d9c
ggml-cpu : add RISC-V Zvfh impl for ggml_vec_mad_f16 (#17448)
* ggml-cpu : add RISC-V Zvfh impl for ggml_vec_mad_f16

* ggml-cpu : dedup scalar impl

* Update ggml/src/ggml-cpu/vec.h

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-11-26 15:33:05 +02:00
Adrien Gallouët 2336cc4784
cmake : use EXCLUDE_FROM_ALL to avoid patch-boringssl.cmake (#17520)
We have to separate the code path starting 3.28 because
`FetchContent_Populate` is now deprecated and will be completely removed
in a future version.

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-11-26 15:15:21 +02:00
Adrien Gallouët e6923caaec
ggml : fix ARM feature verification (#17519)
On arm64 with `cmake` version 3.31.6, the final feature verification fails:

    -- ARM detected flags: -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs
    -- Performing Test GGML_MACHINE_SUPPORTS_dotprod
    -- Performing Test GGML_MACHINE_SUPPORTS_dotprod - Success
    -- Performing Test GGML_MACHINE_SUPPORTS_i8mm
    -- Performing Test GGML_MACHINE_SUPPORTS_i8mm - Success
    -- Performing Test GGML_MACHINE_SUPPORTS_sve
    -- Performing Test GGML_MACHINE_SUPPORTS_sve - Success
    -- Performing Test GGML_MACHINE_SUPPORTS_sme
    -- Performing Test GGML_MACHINE_SUPPORTS_sme - Failed
    -- Performing Test GGML_MACHINE_SUPPORTS_nosme
    -- Performing Test GGML_MACHINE_SUPPORTS_nosme - Success
    -- Checking for ARM features using flags:
    --   -U__ARM_FEATURE_SME
    --   -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs+dotprod+i8mm+sve+nosme
    -- Performing Test HAVE_DOTPROD
    -- Performing Test HAVE_DOTPROD - Failed
    -- Performing Test HAVE_SVE
    -- Performing Test HAVE_SVE - Failed
    -- Performing Test HAVE_MATMUL_INT8
    -- Performing Test HAVE_MATMUL_INT8 - Failed
    -- Performing Test HAVE_FMA
    -- Performing Test HAVE_FMA - Success
    -- Performing Test HAVE_FP16_VECTOR_ARITHMETIC
    -- Performing Test HAVE_FP16_VECTOR_ARITHMETIC - Failed
    -- Performing Test HAVE_SME
    -- Performing Test HAVE_SME - Failed
    -- Adding CPU backend variant ggml-cpu: -U__ARM_FEATURE_SME;-mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs+dotprod+i8mm+sve+nosme

We need to explicitly replace `;` with spaces from the list to make
`CMAKE_REQUIRED_FLAGS` work correctly...

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-11-26 15:14:41 +02:00
Aleksander Grygier d0d7a88d13 chore: update webui build output 2025-11-26 14:14:15 +01:00
Aleksander Grygier 23a91cd257 refactor: Icons 2025-11-26 14:13:17 +01:00
Aleksander Grygier b1cf8bb814 refactor: Improve server properties management 2025-11-26 14:05:42 +01:00
Jiacheng (Jason) Chen 3e18dba9fd
HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 (#17502)
* patch failed test case MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4

* Quick clean up on mma.cuh to add ggml_cuda_memcpy_1 back in for half2 and bfloat162
2025-11-26 11:18:48 +01:00
hipudding eeb5605de2
CANN: Add MROPE and IMROPE support (#17401)
* CANN: ROPE supports both MROPE and IMROPE.

1. Optimize the caching logic of rope_cache_init.
2. Add support for mRoPE and i-mRoPE.

Note that on Ascend 910B devices, it is necessary to disable FA
in CLIP and disable NZ-format conversion. These two issues are
still under investigation.

* Resolve review comments
2025-11-26 16:44:19 +08:00
o7si f3a848a3b1
chore: upgrade cpp-httplib from v0.27.0 to v0.28.0 (#17513) 2025-11-26 09:21:06 +02:00
Jeff Bolz b3b03a7baf
vulkan: Implement GGML_OP_CUMSUM (#17479) 2025-11-26 07:08:10 +01:00