HappyZ happyz
happyz synced commits to refs/pull/7058/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:48 -07:00
63a0cc2959 Merge c9c89524ad864d4e9ebb5f784398ca336c7fc36b into 4426e2987b
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/7020/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:48 -07:00
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
9da243b36a Revert "llava : add support for moondream vision language model (#6899)"
Compare 9 commits »
happyz synced commits to refs/pull/7061/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:48 -07:00
13708b4452 Merge 78ee06e5d97b750f52aaf5a6f0c229e391d153c1 into 4426e2987b
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/7083/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:48 -07:00
13d72b0e01 Merge 19d51bae2cd2ae85e559c60eee943d95dfc2eb42 into f98eb31c51
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
9da243b36a Revert "llava : add support for moondream vision language model (#6899)"
Compare 9 commits »
happyz synced commits to refs/pull/6940/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 19 commits »
happyz synced commits to refs/pull/6951/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
9da243b36a Revert "llava : add support for moondream vision language model (#6899)"
bd1871fa2b server : add themes + favicon (#6848)
Compare 8 commits »
happyz synced commits to refs/pull/6958/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
83330d8cd6 main : add --conversation / -cnv flag (#7108)
465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)
Compare 3 commits »
happyz synced commits to refs/pull/6965/head at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
ea47119736 Merge branch 'master' into gg/bpe-preprocess
77cbb79532 Refactor random tokenizer test
70ca1fe204 Clean gen-unicode-data.py
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 39 commits »
happyz synced commits to refs/pull/6965/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
ea47119736 Merge branch 'master' into gg/bpe-preprocess
77cbb79532 Refactor random tokenizer test
Compare 42 commits »
happyz synced commits to refs/pull/6999/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:47 -07:00
5542cec719 Merge 9a8db6b2b4b54d1da178fd7c961839276fd98e52 into 83330d8cd6
83330d8cd6 main : add --conversation / -cnv flag (#7108)
465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)
Compare 3 commits »
happyz synced commits to refs/pull/6919/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:46 -07:00
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/6915/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:46 -07:00
f3fac588d7 Merge e2dcf468dc40866ff4468bdc0b41d61bbaf5caec into 4426e2987b
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/6888/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:46 -07:00
83330d8cd6 main : add --conversation / -cnv flag (#7108)
465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)
911b3900dd server : add_special option for tokenize endpoint (#7059)
ad211edef5 convert.py : --vocab-only generates false but valid params (#7027)
Compare 29 commits »
happyz synced commits to refs/pull/6869/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:46 -07:00
782d87703e Merge cf9dca34cbd02c89fb3f3c46e85da817fe89744b into f98eb31c51
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
9da243b36a Revert "llava : add support for moondream vision language model (#6899)"
Compare 14 commits »
happyz synced commits to refs/pull/6829/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:45 -07:00
b0aea23612 Merge de1cf88601a527d6869696ff3f2c1cefb30b2f42 into 4426e2987b
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/6834/head at happyz/llama.cpp from mirror 2024-05-08 18:38:45 -07:00
8fe8231313 ChatON:SubPartsAwareTokenizePath: Allow extract subparts testing
a49697b488 ChatON: Keep compiler happy simbly
0d81ffe6eb Tests:ChatON: Add partial skeleton wrt subparts tokenizing
Compare 3 commits »
happyz synced commits to refs/pull/6839/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:45 -07:00
4426e2987b cmake : fix typo (#7151)
f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
Compare 10 commits »
happyz synced commits to refs/pull/6834/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:45 -07:00
8fe8231313 ChatON:SubPartsAwareTokenizePath: Allow extract subparts testing
a49697b488 ChatON: Keep compiler happy simbly
0d81ffe6eb Tests:ChatON: Add partial skeleton wrt subparts tokenizing
Compare 4 commits »
happyz synced commits to refs/pull/6826/head at happyz/llama.cpp from mirror 2024-05-08 18:38:44 -07:00
8e36fd5a70 Merge branch 'master' of https://github.com/JoanFM/llama.cpp into feat-jina-embeddings
b7ede48294 llama : fix pre-tokenizers
83330d8cd6 main : add --conversation / -cnv flag (#7108)
465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)
e59b54657a Merge branch 'master' into feat-jina-embeddings
Compare 26 commits »
happyz synced commits to refs/pull/6826/merge at happyz/llama.cpp from mirror 2024-05-08 18:38:44 -07:00
bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)
c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)
9da243b36a Revert "llava : add support for moondream vision language model (#6899)"
bd1871fa2b server : add themes + favicon (#6848)
Compare 12 commits »