HappyZ happyz
happyz synced new reference refs/tags/b8021 to happyz/llama.cpp from mirror 2026-02-13 06:02:35 -08:00
happyz synced commits to refs/tags/b8022 at happyz/llama.cpp from mirror 2026-02-13 06:02:35 -08:00
happyz synced new reference refs/tags/b8022 to happyz/llama.cpp from mirror 2026-02-13 06:02:35 -08:00
happyz synced new reference refs/tags/b8017 to happyz/llama.cpp from mirror 2026-02-13 06:02:34 -08:00
happyz synced commits to refs/tags/b8018 at happyz/llama.cpp from mirror 2026-02-13 06:02:34 -08:00
happyz synced new reference refs/tags/b8018 to happyz/llama.cpp from mirror 2026-02-13 06:02:34 -08:00
happyz synced commits to refs/tags/b8020 at happyz/llama.cpp from mirror 2026-02-13 06:02:34 -08:00
happyz synced commits to refs/pull/19558/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 14 commits »
happyz synced commits to refs/pull/19566/head at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
d812b6955b Update ggml/src/ggml-cuda/ggml-cuda.cu
happyz synced commits to refs/pull/19569/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 14 commits »
happyz synced commits to refs/pull/19573/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 14 commits »
happyz synced commits to refs/tags/b8017 at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
happyz synced commits to refs/pull/19572/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:33 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 14 commits »
happyz synced commits to refs/pull/19547/head at happyz/llama.cpp from mirror 2026-02-13 06:02:32 -08:00
1bb128d3e6 pre-downsample position embeddings during GGUF conversion for fixed input size
a565bbd1b4 simplified code; addressed reviews
Compare 2 commits »
happyz synced commits to refs/pull/19547/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:32 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
1bb128d3e6 pre-downsample position embeddings during GGUF conversion for fixed input size
b2ecc0cdb4 support --verbose-prompt (#19576)
Compare 16 commits »
happyz synced commits to refs/pull/19553/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:32 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 16 commits »
happyz synced commits to refs/pull/19557/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:32 -08:00
cc2aa81513 Fix wrong memcpy length for block_interleave == 4 (#19575)
0e21991472 fix vulkan ggml_acc only works in 3d but not 4d (#19426)
b2ecc0cdb4 support --verbose-prompt (#19576)
5065da554e CUDA: loop over ne2*ne3 in case it overflows (#19538)
Compare 14 commits »
happyz synced commits to refs/pull/19531/head at happyz/llama.cpp from mirror 2026-02-13 06:02:31 -08:00
a46782c1b7 Merge branch 'ggml-org:master' into Kimi-Linear
25224c8021 llama : remove deprecated codecvt (#19565)
2f5d8f8edc vendor : update BoringSSL to 0.20260211.0 (#19562)
bb96bfd361 memory : fix kv cache size for hybrid models (#19559)
0644baefde metal : improve concurrency (#19555)
Compare 22 commits »
happyz synced commits to refs/pull/19532/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:31 -08:00
423cf0b26f docs : fix broken link and typo (#19560)
33a56f90a6 model : Kimi Linear fix conv state update (#19531)
25224c8021 llama : remove deprecated codecvt (#19565)
2f5d8f8edc vendor : update BoringSSL to 0.20260211.0 (#19562)
Compare 10 commits »
happyz synced commits to refs/pull/19535/merge at happyz/llama.cpp from mirror 2026-02-13 06:02:31 -08:00
423cf0b26f docs : fix broken link and typo (#19560)
33a56f90a6 model : Kimi Linear fix conv state update (#19531)
25224c8021 llama : remove deprecated codecvt (#19565)
2f5d8f8edc vendor : update BoringSSL to 0.20260211.0 (#19562)
Compare 8 commits »