HappyZ happyz
happyz synced commits to refs/pull/6408/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:57 -07:00
380d6ea0f9 Merge de8851868dd27651c941b8534ff32f2a612b4905 into bbe3c6e761
bbe3c6e761 ci: server: fix python installation (#6925)
7f5ff558ee server: stop generation at `n_ctx_train` if `n_predict` is not set (#6638)
9e4e077ec5 ci: server: fix python installation (#6922)
83b72cb086 Merge pull request from GHSA-p5mv-gjc5-mwqv
Compare 8 commits »
happyz synced commits to refs/pull/6287/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
83b72cb086 Merge pull request from GHSA-p5mv-gjc5-mwqv
d4a9afc100 ci: server: fix python installation (#6918)
7d641c26ac ci: fix concurrency for pull_request_target (#6917)
5790c8dac1 bench: server add stop word for PHI-2 (#6916)
Compare 5 commits »
happyz synced commits to refs/pull/6312/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
83b72cb086 Merge pull request from GHSA-p5mv-gjc5-mwqv
d4a9afc100 ci: server: fix python installation (#6918)
7d641c26ac ci: fix concurrency for pull_request_target (#6917)
5790c8dac1 bench: server add stop word for PHI-2 (#6916)
Compare 14 commits »
happyz synced commits to refs/pull/6389/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
da869b9c0b Merge 9126de013a4d8cabde26b4d03267b49f5819c3ce into bbe3c6e761
bbe3c6e761 ci: server: fix python installation (#6925)
7f5ff558ee server: stop generation at `n_ctx_train` if `n_predict` is not set (#6638)
9e4e077ec5 ci: server: fix python installation (#6922)
83b72cb086 Merge pull request from GHSA-p5mv-gjc5-mwqv
Compare 20 commits »
happyz synced commits to refs/pull/6035/head at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
94610511da add q8_t transform
0c159d8c9e transform quant tensor format to speed up Ascend backend
Compare 2 commits »
happyz synced commits to refs/pull/5021/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
928e0b7013 Reset schedule earlier to allow overlap with ggml graph computation on device (#6933)
0c4d489e29 quantize: add imatrix and dataset metadata in GGUF (#6658)
017e6999b5 add basic tensor data validation function (#6884)
e2764cd7ca gguf : fix mismatch between alloc and free functions (#6929)
Compare 13 commits »
happyz synced commits to refs/pull/6035/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:56 -07:00
6afaca7fd9 Merge 94610511daf7d2ccac8c5ff047da43e5a99cad77 into bbe3c6e761
bbe3c6e761 ci: server: fix python installation (#6925)
7f5ff558ee server: stop generation at `n_ctx_train` if `n_predict` is not set (#6638)
9e4e077ec5 ci: server: fix python installation (#6922)
94610511da add q8_t transform
Compare 10 commits »
happyz synced commits to gg/bpe-preprocess at happyz/llama.cpp from mirror 2024-04-26 11:13:55 -07:00
happyz synced and deleted reference refs/tags/refs/pull/6884/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:55 -07:00
happyz synced and deleted reference refs/tags/refs/pull/6658/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:55 -07:00
happyz synced commits to master at happyz/llama.cpp from mirror 2024-04-26 11:13:55 -07:00
928e0b7013 Reset schedule earlier to allow overlap with ggml graph computation on device (#6933)
0c4d489e29 quantize: add imatrix and dataset metadata in GGUF (#6658)
017e6999b5 add basic tensor data validation function (#6884)
e2764cd7ca gguf : fix mismatch between alloc and free functions (#6929)
4b1c3c98b4 llamafile : use 64-bit integers in sgemm (#6928)
Compare 12 commits »
happyz synced new reference gg/bpe-preprocess to happyz/llama.cpp from mirror 2024-04-26 11:13:55 -07:00
happyz synced and deleted reference refs/tags/hp/server/avoid-infinite-loop at happyz/llama.cpp from mirror 2024-04-26 11:13:54 -07:00
happyz synced and deleted reference refs/tags/hp/quantize/imatrix-metadata at happyz/llama.cpp from mirror 2024-04-26 11:13:54 -07:00
happyz synced and deleted reference refs/tags/refs/pull/6638/merge at happyz/llama.cpp from mirror 2024-04-26 11:13:54 -07:00
happyz synced and deleted reference refs/tags/sl/check-tensor at happyz/llama.cpp from mirror 2024-04-26 11:13:54 -07:00
happyz synced commits to refs/pull/4573/merge at happyz/fastapi from mirror 2024-04-26 11:13:41 -07:00
b254688f37 📝 Update release notes
026af6e248 🌐 Update Chinese translation for `docs/zh/docs/fastapi-people.md` (#11476)
Compare 3 commits »
happyz synced commits to refs/pull/1945/merge at happyz/fastapi from mirror 2024-04-26 11:13:41 -07:00
b254688f37 📝 Update release notes
026af6e248 🌐 Update Chinese translation for `docs/zh/docs/fastapi-people.md` (#11476)
38929aae1b 📝 Update release notes
550092a3bd ✏️ Fix typo in `fastapi/security/api_key.py` (#11481)
Compare 32 commits »
happyz synced commits to refs/pull/11439/merge at happyz/fastapi from mirror 2024-04-26 11:13:41 -07:00
b254688f37 📝 Update release notes
026af6e248 🌐 Update Chinese translation for `docs/zh/docs/fastapi-people.md` (#11476)
Compare 3 commits »
happyz synced commits to refs/pull/11253/merge at happyz/fastapi from mirror 2024-04-26 11:13:41 -07:00
b254688f37 📝 Update release notes
026af6e248 🌐 Update Chinese translation for `docs/zh/docs/fastapi-people.md` (#11476)
Compare 3 commits »