Default Branch

4fd59e8427 · ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when GGML_NATIVE=ON (#18413) · Updated 2025-12-27 17:33:14 -08:00

Branches

0248ca811e · gguf : add notes for tests · Updated 2023-08-24 23:08:05 -07:00    happyz

6500
10

977629a34e · Merge branch 'master' into fix-eos · Updated 2023-08-23 12:40:19 -07:00    happyz

6516
4

66a66a05a8 · readme : add notice about new file format · Updated 2023-08-21 12:42:14 -07:00    happyz

6545
253

6a9e6375b5 · gguf.py : indentation · Updated 2023-08-17 11:53:15 -07:00    happyz

6560
205

28046d1e52 · Merge and update · Updated 2023-08-08 14:36:11 -07:00    happyz

6613
12

511055722e · undo formatting · Updated 2023-07-27 23:09:14 -07:00    happyz

6642
26

af1c9966c8 · gguf : start write tensor info · Updated 2023-07-27 00:32:31 -07:00    happyz

6642
15

d273bfd2c9 · allocator: cleanup, more comments · Updated 2023-07-22 06:05:24 -07:00    happyz

6713
21

d45c1631bc · metal : rewrite to fit new backend interface correctly (WIP) · Updated 2023-07-20 12:51:19 -07:00    happyz

6713
18

0492363137 · mpi : fix after master merge · Updated 2023-07-09 12:23:04 -07:00    happyz

6744
21

26cc1bd7a2 · llama : uniform variable names + struct init · Updated 2023-07-05 13:22:17 -07:00    happyz

6761
4

ff6e39f138 · use javascript generators as much cleaner API · Updated 2023-07-05 12:03:01 -07:00    happyz

6774
20

f46db27ea0 · ci : disable FMA on Mac OS · Updated 2023-07-05 08:29:08 -07:00    happyz

6771
5

5cc672a9a5 · metal : try to utilize more of the shared memory using smaller views · Updated 2023-06-26 12:23:04 -07:00    happyz

6808
1

78fafcaf10 · ggml : do not use _GNU_SOURCE gratuitously · Updated 2023-06-25 07:21:02 -07:00    happyz

6816
1

20054a38c1 · Fix directory name · Updated 2023-05-26 16:00:08 -07:00    happyz

6966
1

a1cdd29cd2 · ggml : rms_norm in chunks · Updated 2023-05-20 00:15:54 -07:00    happyz

6987
2

95dc4d7270 · Merge 'origin/master' into steering · Updated 2023-05-19 13:19:57 -07:00    happyz

6989
9

40ec4882c4 · ggml : use F16C conversion when available · Updated 2023-05-17 10:05:51 -07:00    happyz

6998
1

a3e6d62283 · cuda : alternative q4_q8 kernel · Updated 2023-05-12 07:02:39 -07:00    happyz

7032
8