HappyZ happyz
happyz synced commits to refs/pull/19503/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:36 -07:00
277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)
384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)
1f34806c44 jinja: coerce input for string-specific filters (#21370)
887535c33f ci: add more binary checks (#21349)
Compare 29 commits »
happyz synced commits to refs/pull/18892/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:35 -07:00
277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)
384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)
1f34806c44 jinja: coerce input for string-specific filters (#21370)
887535c33f ci: add more binary checks (#21349)
Compare 30 commits »
happyz synced commits to refs/pull/18816/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:34 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 19 commits »
happyz synced commits to refs/pull/18750/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:33 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 84 commits »
happyz synced commits to refs/pull/18574/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:32 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 26 commits »
happyz synced commits to refs/pull/18586/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:32 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 22 commits »
happyz synced commits to refs/pull/18432/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:31 -07:00
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)
Compare 32 commits »
happyz synced commits to refs/pull/17791/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:30 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 16 commits »
happyz synced commits to refs/pull/17330/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:29 -07:00
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)
Compare 67 commits »
happyz synced commits to refs/pull/17342/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:29 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 20 commits »
happyz synced commits to refs/pull/16948/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:28 -07:00
277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)
384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)
1f34806c44 jinja: coerce input for string-specific filters (#21370)
887535c33f ci: add more binary checks (#21349)
Compare 14 commits »
happyz synced commits to refs/pull/16915/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:27 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 69 commits »
happyz synced commits to refs/pull/15805/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:26 -07:00
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)
277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)
384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)
Compare 19 commits »
happyz synced commits to refs/pull/16753/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:26 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 87 commits »
happyz synced commits to master at happyz/llama.cpp from mirror 2026-04-03 19:01:25 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)
Compare 5 commits »
happyz synced commits to refs/pull/12243/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:25 -07:00
d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)
e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)
50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)
f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)
Compare 71 commits »
happyz synced and deleted reference refs/tags/refs/pull/21357/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:24 -07:00
happyz synced commits to cross-profiler at happyz/llama.cpp from mirror 2026-04-03 19:01:24 -07:00
d35b766819 Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run
4cd2db730a docs, pass copy details
7ed999669f fix mul_mat_id stats, add throughput stat, add envvar trigger, add concurrent mode fix
31cb80be29 fix builds, integrate vulkan profiler, fix copy events, fix export
5ba2845db3 Fix more missing backend stuff (and Python errors)
Compare 12 commits »
happyz synced and deleted reference refs/tags/refs/pull/21292/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:23 -07:00
happyz synced and deleted reference refs/tags/refs/pull/21327/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:23 -07:00