Branches - happyz/llama.cpp - HappyGit

master

4fd59e8427 · ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when GGML_NATIVE=ON (#18413) · Updated 2025-12-27 17:33:14 -08:00

gg/metal-opt-mul-mat-id 9f51f3e695 · metal : opt mul_mm_id · Updated 2024-01-02 10:50:18 -08:00 happyz	5806 17		ZIP TAR.GZ
cuda-cublas-opts 4cc78d3873 · ggml : force F32 precision for ggml_mul_mat · Updated 2024-01-02 07:54:56 -08:00 happyz	5805 1		ZIP TAR.GZ
gg/avoid-mutex b5af7ad84f · llama : refactor quantization to avoid <mutex> header · Updated 2024-01-02 05:56:57 -08:00 happyz	5808 1		ZIP TAR.GZ
gg/hf-auto-dl 120a1a5515 · llama : auto download HF models if URL provided · Updated 2024-01-02 03:29:06 -08:00 happyz	5809 1		ZIP TAR.GZ
gg/gpu-prec-tests f64e4f04e7 · ggml : testing GPU FP precision via quantized CPY · Updated 2023-12-30 09:11:40 -08:00 happyz	5827 1		ZIP TAR.GZ
gg/test-arm f32f30bc57 · test · Updated 2023-12-26 07:52:42 -08:00 happyz	5857 1		ZIP TAR.GZ
gg/ggml_scale ab1b75166f · Merge branch 'master' into gg/ggml_scale · Updated 2023-12-21 12:35:11 -08:00 happyz	5880 4		ZIP TAR.GZ
ceb/fix-draft-model-default 7c87353e61 · common : remove incorrect --model-draft default · Updated 2023-12-21 09:17:12 -08:00 happyz	5888 1		ZIP TAR.GZ
gg/cublas-f32 a40f6110f0 · ggml : force F32 precision for ggml_mul_mat · Updated 2023-12-19 06:34:59 -08:00 happyz	5895 1		ZIP TAR.GZ
gg/plamo-test 3c734f4941 · plamo : testing · Updated 2023-12-18 07:06:05 -08:00 happyz	5900 13		ZIP TAR.GZ
gg/phi-2-2 a462159c43 · cuda : ggml_cuda_op_mul_mat_cublas support F32 precision · Updated 2023-12-18 04:24:29 -08:00 happyz	5900 16		ZIP TAR.GZ
ceb/fix-logit-check 1b05817112 · decode : fix logits_valid for old API · Updated 2023-12-17 15:49:21 -08:00 happyz	5901 1		ZIP TAR.GZ
gg/swiftui-bench 865066621b · llama.swiftui : improve bench · Updated 2023-12-17 09:37:22 -08:00 happyz	5915 12		ZIP TAR.GZ
pr/4484 f86b9d152c · lookup : minor · Updated 2023-12-17 07:25:28 -08:00 happyz	5913 9		ZIP TAR.GZ
gg/phi-2 d2f1e0dacc · Merge branch 'cuda-cublas-opts' into gg/phi-2 · Updated 2023-12-16 22:41:46 -08:00 happyz	5911 17		ZIP TAR.GZ
ceb/fix-badspecial-silentfail b0547d2196 · gguf-py : fail fast on nonsensical special token IDs · Updated 2023-12-15 15:06:42 -08:00 happyz	5913 1		ZIP TAR.GZ
ceb/fix-cuda-warning-flags c8554b80be · Merge branch 'master' of https://github.com/ggerganov/llama.cpp into ceb/fix-cuda-warning-flags · Updated 2023-12-13 09:06:01 -08:00 happyz	5925 12		ZIP TAR.GZ
mixtral e1241d9b46 · metal : switch to execution barriers + fix one of the barriers · Updated 2023-12-13 03:56:45 -08:00 happyz	5936 47		ZIP TAR.GZ
gg/per-layer-kv fc5f334689 · readme : add API change notice · Updated 2023-12-07 02:35:02 -08:00 happyz	5938 15		ZIP TAR.GZ
gg/quantum-k-cache af99c6fbfc · llama : remove memory_f16 and kv_f16 flags · Updated 2023-12-05 08:18:16 -08:00 happyz	5950 26		ZIP TAR.GZ

... 19 20 21 22 23 ...