HappyZ happyz
happyz synced new reference refs/tags/b2885 to happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00
happyz synced new reference refs/tags/b2884 to happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00
happyz synced commits to refs/tags/b2885 at happyz/llama.cpp from mirror 2024-05-15 10:42:01 -07:00
happyz synced commits to refs/pull/7288/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 10 commits »
happyz synced commits to refs/pull/7286/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7285/merge at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7288/head at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00
5c65037280 Update README.md
98a40c555c fix grammer lol.
Compare 2 commits »
happyz synced commits to refs/tags/b2884 at happyz/llama.cpp from mirror 2024-05-15 10:42:00 -07:00
happyz synced commits to refs/pull/7269/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
ea3b0590ee embedding : free the batch after execution (#7297)
29499bb593 sync : ggml
48aa8fd1f2 ggml : add `ggml_upscale_ext` (ggml/814)
583fd6b000 server bench: fix bench not waiting for model load (#7284)
Compare 5 commits »
happyz synced commits to refs/pull/7270/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7272/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
d89b3ce50e Merge f5aef4657e3b73d6855da22726cc06f68151de82 into 583fd6b000
583fd6b000 server bench: fix bench not waiting for model load (#7284)
Compare 2 commits »
happyz synced commits to refs/pull/7273/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
583fd6b000 server bench: fix bench not waiting for model load (#7284)
Compare 2 commits »
happyz synced commits to refs/pull/7274/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
583fd6b000 server bench: fix bench not waiting for model load (#7284)
Compare 2 commits »
happyz synced commits to refs/pull/7279/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:59 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7267/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7263/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7246/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
2185e5cf14 docs: Update and fix CLI help descriptions
b4b6f1fa00 fix: End messages with a user role due to jinja2 conditional checks
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
Compare 10 commits »
happyz synced commits to refs/pull/7258/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
9a17ab914b Add missing " (#7303)
ea3b0590ee embedding : free the batch after execution (#7297)
Compare 8 commits »
happyz synced commits to refs/pull/7246/head at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
2185e5cf14 docs: Update and fix CLI help descriptions
b4b6f1fa00 fix: End messages with a user role due to jinja2 conditional checks
Compare 2 commits »
happyz synced commits to refs/pull/7245/merge at happyz/llama.cpp from mirror 2024-05-15 10:41:58 -07:00
aaceba9304 Merge 2642da0ca8883994d20a73bebcd80f6f59b06c69 into dc020985b8
dc020985b8 Avoid unnecessarily disabling CUDA graphs (#7302)
2642da0ca8 Get rid of BOM
344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)
fe2434e3e9 Minor + style
Compare 10 commits »