HappyZ happyz
happyz synced commits to refs/pull/19743/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:26 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 21 commits »
happyz synced commits to refs/pull/19441/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:25 -07:00
650bf14eb9 llama-model: read final_logit_softcapping for Gemma 4 (#21390)
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
Compare 3 commits »
happyz synced commits to refs/pull/19691/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:25 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 26 commits »
happyz synced commits to refs/pull/19493/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:25 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 7 commits »
happyz synced commits to refs/pull/19434/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:24 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 22 commits »
happyz synced commits to refs/pull/19341/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:24 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 47 commits »
happyz synced commits to refs/pull/19294/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:24 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 22 commits »
happyz synced commits to refs/pull/19254/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:24 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 12 commits »
happyz synced commits to refs/pull/19196/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:23 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 72 commits »
happyz synced commits to refs/pull/19171/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:23 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 33 commits »
happyz synced commits to refs/pull/18923/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:23 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 82 commits »
happyz synced commits to refs/pull/18432/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:22 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
Compare 3 commits »
happyz synced commits to refs/pull/18742/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:22 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 125 commits »
happyz synced commits to refs/pull/18588/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:22 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 15 commits »
happyz synced commits to refs/pull/18465/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:22 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 58 commits »
happyz synced commits to refs/pull/18150/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:21 -07:00
650bf14eb9 llama-model: read final_logit_softcapping for Gemma 4 (#21390)
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
Compare 13 commits »
happyz synced commits to refs/pull/17791/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:21 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
Compare 2 commits »
happyz synced commits to refs/pull/18373/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:21 -07:00
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
Compare 23 commits »
happyz synced commits to refs/pull/17330/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:20 -07:00
d01f6274c0 common : respect specified tag, only fallback when tag is empty (#21413)
650bf14eb9 llama-model: read final_logit_softcapping for Gemma 4 (#21390)
b7ad48ebda llama: add custom newline split for Gemma 4 (#21406)
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
Compare 5 commits »
happyz synced commits to refs/pull/16948/merge at happyz/llama.cpp from mirror 2026-04-04 07:02:20 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 6 commits »