HappyZ

happyz synced commits to refs/pull/19530/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00

ece93a4ce3 Merge e29c48019b into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 9 commits »

happyz synced commits to refs/pull/19536/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00

a2070b3a9d Merge 233f5ab82d into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 11 commits »

happyz synced commits to refs/pull/19558/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00

5da01d287b Merge c3f8de0e0c into d612901116

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

5f28c53d11 model: Add support for Tiny Aya Models (#19611)

Compare 6 commits »

happyz synced commits to refs/pull/19572/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:31 -08:00

23fecccf84 Merge db4a5a84fc into d612901116

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

5f28c53d11 model: Add support for Tiny Aya Models (#19611)

Compare 8 commits »

happyz synced commits to refs/pull/19514/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00

ccebad6fe8 Merge d66cb47a17 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 14 commits »

happyz synced commits to refs/pull/19526/head at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00

1cd57c81d8 Merge branch 'ggml-org:master' into llama-quantize-dry-run

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

5f28c53d11 model: Add support for Tiny Aya Models (#19611)

Compare 22 commits »

happyz synced commits to refs/pull/19526/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00

511f29cbc8 Merge 1cd57c81d8 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

1cd57c81d8 Merge branch 'ggml-org:master' into llama-quantize-dry-run

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

Compare 8 commits »

happyz synced commits to refs/pull/19527/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:30 -08:00

6b63456bdb Merge e459796110 into d612901116

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

5f28c53d11 model: Add support for Tiny Aya Models (#19611)

Compare 8 commits »

happyz synced commits to refs/pull/19493/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00

7d9cda3665 Merge 0fa66c2774 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

0fa66c2774 chore: update webui build output

2f31b2da63 server : log levels

2ee85994e0 server : rename spec vars

Compare 16 commits »

happyz synced commits to refs/pull/19503/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00

ceaf729db0 Merge 3637bf51cd into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 7 commits »

happyz synced commits to refs/pull/19504/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00

2332005ae6 Merge 3db6e5ef22 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 7 commits »

happyz synced commits to refs/pull/19509/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:29 -08:00

28daa074ea Merge feb3eef27e into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 8 commits »

happyz synced commits to refs/pull/19434/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00

1607aacda4 Merge 05dfc18d55 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 11 commits »

happyz synced commits to refs/pull/19440/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00

601e9a36df Merge f5f2203ed4 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 11 commits »

happyz synced commits to refs/pull/19478/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00

89175f6bf9 Merge 49a5ff40e2 into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 11 commits »

happyz synced commits to refs/pull/19493/head at happyz/llama.cpp from mirror 2026-02-16 18:01:28 -08:00

0fa66c2774 chore: update webui build output

2f31b2da63 server : log levels

2ee85994e0 server : rename spec vars

c9fc6af71d server : fix draft check with checkpoints

07747a0836 server : speculative decoding using checkpoints

Compare 100 commits »

happyz synced commits to refs/pull/19361/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00

fa9b203ff2 Merge b7bab70acd into 05fa625eac

b7bab70acd server: to_json_oaicompat cached_tokens

f79c5472a9 tests : fix fetch_server_test_models.py

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

Compare 29 commits »

happyz synced commits to refs/pull/19378/head at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00

f0198ef6fc Merge pull request #6 from gaugarg-nv/get_host_buffer_type

aa8b62105c Support device-specific host buffer types if all underlying backends expose the same type. This allows using pinned memory instead of pageable memory for CUDA.

Compare 2 commits »

happyz synced commits to refs/pull/19378/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00

4df928973b Merge f0198ef6fc into 05fa625eac

05fa625eac convert : add JoyAI-LLM-Flash (#19651)

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

Compare 9 commits »

happyz synced commits to refs/pull/19409/merge at happyz/llama.cpp from mirror 2026-02-16 18:01:27 -08:00

3bafcefae9 Merge 1f42650078 into d612901116

d612901116 perplexity: add proper batching (#19661)

cceb1b4e33 common : inline functions (#18639)

d23a55997d ggml : make `ggml_is_view` as API (#19539)

5f28c53d11 model: Add support for Tiny Aya Models (#19611)

Compare 8 commits »