HappyZ happyz
happyz synced commits to refs/pull/20609/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:02 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 10 commits »
happyz synced commits to refs/pull/20624/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:02 -07:00
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
342d6125bc metal : add FA instantiations for HSK=512, HSV=512 (#20902)
c2e224d829 issues: add openvino backends (#20932)
Compare 11 commits »
happyz synced commits to refs/pull/20563/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:01 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 7 commits »
happyz synced commits to refs/pull/20566/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:01 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 7 commits »
happyz synced commits to refs/pull/20505/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:01 -07:00
be89866087 Update convert_hf_to_gguf.py
6ebc1ea1d6 Merge branch 'ggml-org:master' into nvfp4-fix-qwen-conversions
30159b37cb Added input scale to loader and named _in_s
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
Compare 15 commits »
happyz synced commits to refs/pull/20503/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:01 -07:00
a94fdb090a WebUI: fix edit msg form textarea height (#20830)
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
Compare 11 commits »
happyz synced commits to refs/pull/20505/head at happyz/llama.cpp from mirror 2026-03-24 07:02:01 -07:00
be89866087 Update convert_hf_to_gguf.py
6ebc1ea1d6 Merge branch 'ggml-org:master' into nvfp4-fix-qwen-conversions
30159b37cb Added input scale to loader and named _in_s
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
Compare 60 commits »
happyz synced commits to refs/pull/20456/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:00 -07:00
8c7957ca33 common : add standard Hugging Face cache support (#20775)
e852eb4901 llama-fit: fix regex pattern for gate_up tensors (#20910)
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
Compare 8 commits »
happyz synced commits to refs/pull/20479/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:00 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 29 commits »
happyz synced commits to refs/pull/20461/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:00 -07:00
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
342d6125bc metal : add FA instantiations for HSK=512, HSV=512 (#20902)
c2e224d829 issues: add openvino backends (#20932)
Compare 35 commits »
happyz synced commits to refs/pull/20472/merge at happyz/llama.cpp from mirror 2026-03-24 07:02:00 -07:00
c2e224d829 issues: add openvino backends (#20932)
8c7957ca33 common : add standard Hugging Face cache support (#20775)
e852eb4901 llama-fit: fix regex pattern for gate_up tensors (#20910)
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
Compare 8 commits »
happyz synced commits to refs/pull/20435/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:59 -07:00
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
342d6125bc metal : add FA instantiations for HSK=512, HSV=512 (#20902)
Compare 34 commits »
happyz synced commits to refs/pull/20451/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:59 -07:00
a94fdb090a WebUI: fix edit msg form textarea height (#20830)
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
Compare 13 commits »
happyz synced commits to refs/pull/20388/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:58 -07:00
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
342d6125bc metal : add FA instantiations for HSK=512, HSV=512 (#20902)
c2e224d829 issues: add openvino backends (#20932)
Compare 26 commits »
happyz synced commits to refs/pull/20394/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:58 -07:00
e852eb4901 llama-fit: fix regex pattern for gate_up tensors (#20910)
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
Compare 9 commits »
happyz synced commits to refs/pull/20346/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:57 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 10 commits »
happyz synced commits to refs/pull/20112/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:57 -07:00
9c00aab225 Merge branch 'ggml-org:master' into llama-quant-refactor
3fc6f1aed1 ggml-backend: re-enable graph reuse with pipeline parallelism (#20927)
29771a0a4c vendor : update cpp-httplib to 0.39.0 (#20933)
42ebce3beb common : fix get_gguf_split_info (#20946)
Compare 22 commits »
happyz synced commits to refs/pull/20158/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:57 -07:00
e852eb4901 llama-fit: fix regex pattern for gate_up tensors (#20910)
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
Compare 7 commits »
happyz synced commits to refs/pull/20172/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:57 -07:00
c9dc43333f readme : clarify MODEL_ENDPOINT usage (#20941)
2d2d9c2062 common : add a WARNING for HF cache migration (#20935)
92080b4396 metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
342d6125bc metal : add FA instantiations for HSK=512, HSV=512 (#20902)
Compare 18 commits »
happyz synced commits to refs/pull/20275/merge at happyz/llama.cpp from mirror 2026-03-24 07:01:57 -07:00
312d870a89 common : replace wrap_for_generation with a prefix convenience function and fix gpt-oss (#20912)
7cadbfce10 hexagon: general DMA and Binary Op fixes for large strides (#20918)
1fb2290a51 Add codeowners for scripts/snapdragon and docs/snapdragon (#20915)
1772701f99 opencl: add q6_K gemm and gemv kernels for Adreno (#20089)
Compare 7 commits »