Default Branch

c1366056f6 · android: routine maintenance - Dec 2025 (#18338) · Updated 2025-12-29 05:51:13 -08:00

Branches

c9e7cbb08b · safer jinja `llama_chat_templates` struct · Updated 2025-01-20 07:58:29 -08:00

3070
34

90a0349349 · recommended way to check if the version is 0.3, as requested by ngxson · Updated 2025-01-19 05:43:59 -08:00

3068
2

ba421dd04e · gguf-test: tensor data comparison · Updated 2025-01-18 00:49:47 -08:00

3070
7

492eaad571 · ci : change python3 -> python · Updated 2025-01-15 06:18:56 -08:00

3084
1

0cf9a06799 · vocab : minor [no ci] · Updated 2025-01-14 00:36:28 -08:00

3093
2

a97b3621cf · ggml : ggml_backend_graph_copy -> ggml_backend_graph_copy_state · Updated 2025-01-12 07:57:51 -08:00

3108
15

9af90481d0 · Vulkan: Add renderdoc tracing support · Updated 2025-01-12 05:47:36 -08:00

3110
1

fbddb26250 · ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1 · Updated 2025-01-11 18:06:49 -08:00

3117
7

9605c5fb28 · cmake : remove explicit _XOPEN_SOURCE · Updated 2025-01-06 03:02:48 -08:00

3162
2

aa014d7e89 · Use mutex instead of atomics for vk_instance counters · Updated 2024-12-29 21:14:58 -08:00

3175
2

fe9235d795 · Force max subgroup size for coopmat shaders · Updated 2024-12-17 23:26:27 -08:00

3221
1

4fbb801a9d · ggml : update ggml_backend_cpu_device_supports_op · Updated 2024-12-17 08:09:02 -08:00

3231
3

3e92f4ecbe · cont [no ci] · Updated 2024-12-15 02:36:03 -08:00

3243
2

7e9208e408 · scripts : change build path to "build-bench" for compare-commits.sh · Updated 2024-12-15 01:47:30 -08:00

3243
1

fb18934a97 · gguf-py : bump version to 0.11.0 · Updated 2024-12-11 13:13:31 -08:00

3264
0
Included

4f3a7e279b · Force max subgroup size for coopmat shaders · Updated 2024-12-10 12:27:04 -08:00

3272
2

b8d1b1a5e1 · server : fix infill prompt format · Updated 2024-12-08 12:12:11 -08:00

3282
1

a6648b9df7 · server : chunked prefill support · Updated 2024-12-07 23:48:18 -08:00

3286
1

a8046c888a · use calloc instead of malloc · Updated 2024-12-04 08:24:35 -08:00

3317
3

81611bef72 · server : add tests · Updated 2024-12-04 03:11:26 -08:00

3317
3