Default Branch

07a0c4ba92 · Revert "ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when GGML_NATIVE=ON (#18413)" (#18426) · Updated 2025-12-28 04:53:36 -08:00

Branches

6b2f496409 · wip · Updated 2024-05-17 04:01:09 -07:00

4658
1

a085a8323a · tmp · Updated 2024-05-16 04:45:47 -07:00

4660
1

5de9b743f8 · sched : support async weight copy · Updated 2024-05-15 15:47:40 -07:00

4667
1

284870c868 · Merge branch 'master' into fix-convert-modelname · Updated 2024-05-13 22:05:49 -07:00

4690
2

94061d58e7 · llama : disable pipeline parallelism with nkvo · Updated 2024-05-13 14:17:53 -07:00

4690
1

33a004e9cc · llama : more metal-friendly KV cache PAD · Updated 2024-05-13 00:32:19 -07:00

4696
1

65a1a58562 · convert-hf : add missing ftype to Baichuan and Xverse · Updated 2024-05-12 09:56:03 -07:00

4709
3

03e940cdec · convert : fix convert for refact models · Updated 2024-05-11 00:31:52 -07:00

4725
10

e0af2df690 · convert-hf : support outtype templating in outfile name · Updated 2024-05-10 10:42:03 -07:00

4738
7

fecb81e302 · metal : fix ggml_metal_supports_op · Updated 2024-05-09 04:01:03 -07:00

4737
2

494f70f939 · cmake : fix typo · Updated 2024-05-08 13:24:02 -07:00

4741
1

bffdaf4010 · Merge branch 'master' into compilade/lazy-convert-hf · Updated 2024-05-08 07:56:03 -07:00

4745
26

0fc560fe96 · ci : enable git lfs for build.yml · Updated 2024-05-08 00:53:02 -07:00

4754
7

c32d39cefb · Merge branch 'master' into compilade/convert-hf-refactor · Updated 2024-05-06 02:33:38 -07:00

4767
19

c240ae234c · ci : fix arg order · Updated 2024-04-30 01:43:36 -07:00

4797
145

5ddad95e5c · ci : tmp disable gguf-split · Updated 2024-04-29 08:29:38 -07:00

4798
1

80cb3127df · tests : disable test-tokenizer-1-bpe due to slowness · Updated 2024-04-29 05:24:39 -07:00

4810
61

8c259f6f3e · ggml : fix MIN / MAX macros · Updated 2024-04-25 04:28:41 -07:00

4835
1

5dcccb3a7d · convert : fix tokenizer conversion · Updated 2024-04-23 12:11:09 -07:00

4846
2

124e4dced2 · Update · Updated 2024-04-22 02:42:32 -07:00

4903
2