Branches - happyz/llama.cpp - HappyGit

master

07a0c4ba92 · Revert "ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when GGML_NATIVE=ON (#18413)" (#18426) · Updated 2025-12-28 04:53:36 -08:00

ik/fix_k_cache_backend_tests 68e4fed4d9 · Now fix test-quantize-fns · Updated 2024-03-21 04:18:03 -07:00 happyz	5081 3		ZIP TAR.GZ
compilade/fix-server-tests-penalty 9a424a3872 · server : fix tests expecting old repeat penalty · Updated 2024-03-19 14:12:28 -07:00 happyz	5097 1		ZIP TAR.GZ
gg/repeng 0a9bc301ac · control-vectors : minor code style updates · Updated 2024-03-14 07:43:37 -07:00 happyz	5136 3		ZIP TAR.GZ
gg/metal-embed abf0afd0d6 · ci : fix iOS builds to use embedded library · Updated 2024-03-14 02:34:22 -07:00 happyz	5154 4		ZIP TAR.GZ
ik/try_fix_iq1s_sycl 9f805264dc · Attempt 2 · Updated 2024-03-12 09:40:13 -07:00 happyz	5154 3		ZIP TAR.GZ
ik/even_better_iq1s 5440a127c7 · iq1_s: fix dequantize on the CPU · Updated 2024-03-11 06:17:28 -07:00 happyz	5167 6		ZIP TAR.GZ
gg/try-fix-sycl-iq1_s 76be02aebc · sycl : fix grid type · Updated 2024-03-11 06:17:08 -07:00 happyz	5162 3		ZIP TAR.GZ
sycl_q3s_q1s 989e15b3c1 · Merge branch 'master' into sycl_q3s_q1s · Updated 2024-03-10 20:11:35 -07:00 happyz	5169 9		ZIP TAR.GZ
gritlm-pr b54afce9f4 · mostly style fixes; fix KQ_mask comment · Updated 2024-03-09 11:03:46 -08:00 happyz	5212 10		ZIP TAR.GZ
gg/bert-f16 0ba20ed97a · llama : compute BERT graph with F16 K, V · Updated 2024-03-07 06:33:30 -08:00 happyz	5202 1		ZIP TAR.GZ
revert-5901-fix_set_gpu b5b0270372 · Revert "[SYCL] fix error when set main gpu to non-zero (#5901)" · Updated 2024-03-07 01:11:18 -08:00 happyz	5206 1		ZIP TAR.GZ
ik/iq3_s_multiplier 31cecc8734 · iq3_s_mult_shuffle: use lookup table on Metal · Updated 2024-03-05 00:19:44 -08:00 happyz	5280 24		ZIP TAR.GZ
gg/fix-embeddings-wip 4ec0e9abbf · wip · Updated 2024-03-04 07:07:12 -08:00 happyz	5229 5		ZIP TAR.GZ
ci/server/fix-slow-test eb0bf32caf · server: tests: schedule slow dispatch only on release or on demand · Updated 2024-03-02 14:18:31 -08:00 happyz	5241 1		ZIP TAR.GZ
ceb/convert-hf-refactor 0b673ca187 · s/_MODEL_CLASSES/_model_classes/ · Updated 2024-03-02 09:14:37 -08:00 happyz	5254 3		ZIP TAR.GZ
ik/iq3_s_faster d4dfc250cc · Fix ARM_NEON · Updated 2024-03-02 00:12:02 -08:00 happyz	5259 7		ZIP TAR.GZ
ceb/convert-vocab-fallback f8ab539190 · convert : update help string · Updated 2024-03-01 09:29:34 -08:00 happyz	5257 3		ZIP TAR.GZ
gg/fix-starcoder2 9862d59c05 · llama : change starcoder2 rope type · Updated 2024-03-01 05:10:31 -08:00 happyz	5266 8		ZIP TAR.GZ
ik/i-quants-64 f0cbb6ddf6 · iq1_s: turn off SIMD implementation for QK_K = 64 (it does not work) · Updated 2024-02-27 22:28:10 -08:00 happyz	5281 6		ZIP TAR.GZ
gg/kv-compress 14d757066b · llama : add llama_kv_cache_compress (EXPERIMENTAL) · Updated 2024-02-27 06:24:40 -08:00 happyz	5282 1		ZIP TAR.GZ

... 16 17 18 19 20 ...