HappyZ

happyz synced commits to refs/pull/19503/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:36 -07:00

60f6cdbdb3 Merge 3637bf51cd into 277ff5fff7

277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)

384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)

1f34806c44 jinja: coerce input for string-specific filters (#21370)

887535c33f ci: add more binary checks (#21349)

Compare 29 commits »

happyz synced commits to refs/pull/18892/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:35 -07:00

6b7c5552b4 Merge 979299a32f into 277ff5fff7

277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)

384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)

1f34806c44 jinja: coerce input for string-specific filters (#21370)

887535c33f ci: add more binary checks (#21349)

Compare 30 commits »

happyz synced commits to refs/pull/18816/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:34 -07:00

a44a875f8c Merge 6cf18ecd6b into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 19 commits »

happyz synced commits to refs/pull/18750/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:33 -07:00

24341f07eb Merge 8bcd53b74e into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 84 commits »

happyz synced commits to refs/pull/18574/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:32 -07:00

62270e7c94 Merge a3a85bb166 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 26 commits »

happyz synced commits to refs/pull/18586/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:32 -07:00

a6dd869452 Merge 646e4d0c85 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 22 commits »

happyz synced commits to refs/pull/18432/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:31 -07:00

a12a4efb3e Merge 7e9bea7f1c into e439700992

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)

Compare 32 commits »

happyz synced commits to refs/pull/17791/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:30 -07:00

fbdb41676b Merge fd94e4cdca into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 16 commits »

happyz synced commits to refs/pull/17330/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:29 -07:00

b3eadd940a Merge 69447f879f into e439700992

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)

Compare 67 commits »

happyz synced commits to refs/pull/17342/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:29 -07:00

d79dfc1751 Merge e516cd0056 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 20 commits »

happyz synced commits to refs/pull/16948/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:28 -07:00

4f2b4be1ad Merge d6d24487c2 into 277ff5fff7

277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)

384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)

1f34806c44 jinja: coerce input for string-specific filters (#21370)

887535c33f ci: add more binary checks (#21349)

Compare 14 commits »

happyz synced commits to refs/pull/16915/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:27 -07:00

3e44f210c0 Merge 8ae5ffb593 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 69 commits »

happyz synced commits to refs/pull/15805/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:26 -07:00

ac8f5dcc36 Merge f1bb125740 into f1f793ad06

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)

277ff5fff7 docker : bump cuda12 to 12.9.1 (#20920)

384c0076bc docs: Update build.md: HSA_OVERRIDE_GFX_VERSION clarification (#21331)

Compare 19 commits »

happyz synced commits to refs/pull/16753/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:26 -07:00

11c359006f Merge d9f3da6b69 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 87 commits »

happyz synced commits to master at happyz/llama.cpp from mirror 2026-04-03 19:01:25 -07:00

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

af5c13841f common : fix tool call type detection for nullable and enum schemas (#21327)

Compare 5 commits »

happyz synced commits to refs/pull/12243/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:25 -07:00

45d1514151 Merge 304aa487d1 into d006858316

d006858316 ggml-webgpu: move from parameter buffer pool to single buffer with offsets (#21278)

e439700992 ci: Add Windows Vulkan backend testing on Intel (#21292)

50e0ad08fb server: save and clear idle slots on new task (`--clear-idle`) (#20993)

f1f793ad06 common/parser: fix call ID detection (Mistral parser mostly) + atomicity for tag-json parsers (#21230)

Compare 71 commits »

happyz synced and deleted reference refs/tags/refs/pull/21357/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:24 -07:00

happyz synced commits to cross-profiler at happyz/llama.cpp from mirror 2026-04-03 19:01:24 -07:00

d35b766819 Add missing op parameters to the profiler; add support for test-backend-ops to run performance tests with exactly the tensor shapes from the run

4cd2db730a docs, pass copy details

7ed999669f fix mul_mat_id stats, add throughput stat, add envvar trigger, add concurrent mode fix

31cb80be29 fix builds, integrate vulkan profiler, fix copy events, fix export

5ba2845db3 Fix more missing backend stuff (and Python errors)

Compare 12 commits »

happyz synced and deleted reference refs/tags/refs/pull/21292/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:23 -07:00

happyz synced and deleted reference refs/tags/refs/pull/21327/merge at happyz/llama.cpp from mirror 2026-04-03 19:01:23 -07:00