HappyZ happyz
happyz synced commits to refs/pull/7269/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:32 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/7270/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:32 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/7285/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:32 -07:00
213e90ed73 ggml-opencl, llama: using reserve() if count already known (#7272)
65c58207ec ggml : add loongarch lsx and lasx support (#6454)
1cc0155d04 server : tuning tests (#7388)
e932094d58 server : return error on too large embedding input (#7389)
Compare 7 commits »
happyz synced commits to refs/pull/7232/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:31 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 22 commits »
happyz synced commits to refs/pull/7246/head at happyz/llama.cpp from mirror 2024-05-20 23:06:31 -07:00
8b67acc8d0 Merge branch 'ggerganov:master' into gguf-model-template
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 92 commits »
happyz synced commits to refs/pull/7246/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:31 -07:00
8b67acc8d0 Merge branch 'ggerganov:master' into gguf-model-template
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
Compare 14 commits »
happyz synced commits to refs/pull/7267/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:31 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/7225/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:31 -07:00
095a574bbe Merge 5283ba3f09f9c9cb3f4457299cdf7266b97f2736 into 20385cebcc
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
3bc10cb485 server : fix temperature + disable some tests (#7409)
6bf9b66fa3 [SYCL] Update SYCL upscale operation (#7321)
Compare 11 commits »
happyz synced commits to refs/pull/6988/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:30 -07:00
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
3bc10cb485 server : fix temperature + disable some tests (#7409)
6bf9b66fa3 [SYCL] Update SYCL upscale operation (#7321)
Compare 24 commits »
happyz synced commits to refs/pull/6999/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:30 -07:00
3afec53d94 Merge ed1d3ffc2a97d4d7aff94e419b9701c14487f6c0 into 917dc8cfa6
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/7020/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:30 -07:00
213e90ed73 ggml-opencl, llama: using reserve() if count already known (#7272)
65c58207ec ggml : add loongarch lsx and lasx support (#6454)
1cc0155d04 server : tuning tests (#7388)
e932094d58 server : return error on too large embedding input (#7389)
Compare 29 commits »
happyz synced commits to refs/pull/7058/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:30 -07:00
1d482bb9fd Merge 080b549958d5ff9f57b9677974e49309c6dec9fa into 6bf9b66fa3
6bf9b66fa3 [SYCL] Update SYCL upscale operation (#7321)
26cd4237bc Update README.md (#7410)
213e90ed73 ggml-opencl, llama: using reserve() if count already known (#7272)
65c58207ec ggml : add loongarch lsx and lasx support (#6454)
Compare 37 commits »
happyz synced commits to refs/pull/6923/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:30 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/6840/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:29 -07:00
1d02fa809a Merge bb3a5274c7c1efd883f7e57edb849c0394d2c91d into 917dc8cfa6
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »
happyz synced commits to refs/pull/6888/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:29 -07:00
213e90ed73 ggml-opencl, llama: using reserve() if count already known (#7272)
65c58207ec ggml : add loongarch lsx and lasx support (#6454)
1cc0155d04 server : tuning tests (#7388)
e932094d58 server : return error on too large embedding input (#7389)
Compare 7 commits »
happyz synced commits to refs/pull/6892/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:29 -07:00
213e90ed73 ggml-opencl, llama: using reserve() if count already known (#7272)
65c58207ec ggml : add loongarch lsx and lasx support (#6454)
1cc0155d04 server : tuning tests (#7388)
e932094d58 server : return error on too large embedding input (#7389)
Compare 29 commits »
happyz synced commits to refs/pull/6919/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:29 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 14 commits »
happyz synced commits to refs/pull/6844/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:29 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 14 commits »
happyz synced commits to refs/pull/6834/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:28 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 14 commits »
happyz synced commits to refs/pull/6839/merge at happyz/llama.cpp from mirror 2024-05-20 23:06:28 -07:00
917dc8cfa6 Tokenizer SPM fixes for phi-3 and llama-spm (#7375)
fabf30b4c4 llama : remove Persimmon (#7408)
20385cebcc perplexity: update README FP16 results [no ci] (#7413)
db10f01310 rpc : track allocated buffers (#7411)
Compare 13 commits »