HappyZ

happyz synced new reference refs/tags/b2885 to happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00

happyz synced new reference refs/tags/b2884 to happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00

happyz synced commits to refs/tags/b2885 at happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00

happyz synced commits to refs/pull/7288/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00

5df3fa96a5 Merge 5c65037280 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 10 commits »

happyz synced commits to refs/pull/7286/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00

8c6172dc3b Merge 2992479a42 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7285/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00

b9b188d834 Merge 4d646f8f13 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7288/head at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00

5c65037280 Update README.md

98a40c555c fix grammer lol.

Compare 2 commits »

happyz synced commits to refs/tags/b2884 at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00

happyz synced commits to refs/pull/7269/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

10a9c969ea Merge ca61d3e498 into ea3b0590ee

ea3b0590ee embedding : free the batch after execution (#7297)

29499bb593 sync : ggml

48aa8fd1f2 ggml : add `ggml_upscale_ext` (ggml/814)

583fd6b000 server bench: fix bench not waiting for model load (#7284)

Compare 5 commits »

happyz synced commits to refs/pull/7270/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

d08d4409ad Merge ced5bfeb33 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7272/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

d89b3ce50e Merge f5aef4657e3b73d6855da22726cc06f68151de82 into 583fd6b000

583fd6b000 server bench: fix bench not waiting for model load (#7284)

Compare 2 commits »

happyz synced commits to refs/pull/7273/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

e3c889e812 Merge c08d69f924 into 583fd6b000

583fd6b000 server bench: fix bench not waiting for model load (#7284)

Compare 2 commits »

happyz synced commits to refs/pull/7274/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

184783673e Merge a30c3ab02c into 583fd6b000

583fd6b000 server bench: fix bench not waiting for model load (#7284)

Compare 2 commits »

happyz synced commits to refs/pull/7279/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00

ebaff681fb Merge 2304113b1c into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7267/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

91bee717aa Merge 2a9a84be7d into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7263/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

972f7433e4 Merge 79b044b0c5 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7246/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

c6c7a289d4 Merge 2185e5cf14 into dc020985b8

2185e5cf14 docs: Update and fix CLI help descriptions

b4b6f1fa00 fix: End messages with a user role due to jinja2 conditional checks

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

Compare 10 commits »

happyz synced commits to refs/pull/7258/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

71b2928036 Merge 29d5012042 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

9a17ab914b Add missing " (#7303)

ea3b0590ee embedding : free the batch after execution (#7297)

Compare 8 commits »

happyz synced commits to refs/pull/7246/head at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

2185e5cf14 docs: Update and fix CLI help descriptions

b4b6f1fa00 fix: End messages with a user role due to jinja2 conditional checks

Compare 2 commits »

happyz synced commits to refs/pull/7245/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00

aaceba9304 Merge 2642da0ca8883994d20a73bebcd80f6f59b06c69 into dc020985b8

dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)

2642da0ca8 Get rid of BOM

344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)

fe2434e3e9 Minor + style

Compare 10 commits »