HappyZ happyz
happyz synced commits to refs/pull/19530/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 9 commits »
happyz synced commits to refs/pull/19536/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 11 commits »
happyz synced commits to refs/pull/19558/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
5f28c53d11 model: Add support for Tiny Aya Models (#19611)
Compare 6 commits »
happyz synced commits to refs/pull/19572/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
5f28c53d11 model: Add support for Tiny Aya Models (#19611)
Compare 8 commits »
happyz synced commits to refs/pull/19514/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 14 commits »
happyz synced commits to refs/pull/19526/head at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00
1cd57c81d8 Merge branch 'ggml-org:master' into llama-quantize-dry-run
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
5f28c53d11 model: Add support for Tiny Aya Models (#19611)
Compare 22 commits »
happyz synced commits to refs/pull/19526/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
1cd57c81d8 Merge branch 'ggml-org:master' into llama-quantize-dry-run
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
Compare 8 commits »
happyz synced commits to refs/pull/19527/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
5f28c53d11 model: Add support for Tiny Aya Models (#19611)
Compare 8 commits »
happyz synced commits to refs/pull/19493/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
0fa66c2774 chore: update webui build output
2f31b2da63 server : log levels
2ee85994e0 server : rename spec vars
Compare 16 commits »
happyz synced commits to refs/pull/19503/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 7 commits »
happyz synced commits to refs/pull/19504/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 7 commits »
happyz synced commits to refs/pull/19509/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 8 commits »
happyz synced commits to refs/pull/19434/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 11 commits »
happyz synced commits to refs/pull/19440/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 11 commits »
happyz synced commits to refs/pull/19478/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 11 commits »
happyz synced commits to refs/pull/19493/head at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00
0fa66c2774 chore: update webui build output
2f31b2da63 server : log levels
2ee85994e0 server : rename spec vars
c9fc6af71d server : fix draft check with checkpoints
07747a0836 server : speculative decoding using checkpoints
Compare 100 commits »
happyz synced commits to refs/pull/19361/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00
b7bab70acd server: to_json_oaicompat cached_tokens
f79c5472a9 tests : fix fetch_server_test_models.py
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
Compare 29 commits »
happyz synced commits to refs/pull/19378/head at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00
f0198ef6fc Merge pull request #6 from gaugarg-nv/get_host_buffer_type
aa8b62105c Support device-specific host buffer types if all underlying backends expose the same type. This allows using pinned memory instead of pageable memory for CUDA.
Compare 2 commits »
happyz synced commits to refs/pull/19378/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00
05fa625eac convert : add JoyAI-LLM-Flash (#19651)
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
Compare 9 commits »
happyz synced commits to refs/pull/19409/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00
d612901116 perplexity: add proper batching (#19661)
cceb1b4e33 common : inline functions (#18639)
d23a55997d ggml : make `ggml_is_view` as API (#19539)
5f28c53d11 model: Add support for Tiny Aya Models (#19611)
Compare 8 commits »