Default Branch

4fd59e8427 · ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when GGML_NATIVE=ON (#18413) · Updated 2025-12-27 17:33:14 -08:00

Branches

acead654d2 · Merge branch 'master' into fix-refact · Updated 2023-10-08 01:25:16 -07:00    happyz

6207
4

6b9554a740 · metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7 · Updated 2023-10-07 23:55:13 -07:00    happyz

6214
5

ba44776dc2 · bump version · Updated 2023-10-07 11:47:48 -07:00    happyz

6213
6

5ab6c2132a · server-parallel : add "--reverse-prompt" + compiler warning fixes · Updated 2023-10-06 04:32:19 -07:00    happyz

6226
4

5418932b71 · llama : fix comments for llama_kv_cache API · Updated 2023-10-03 11:01:52 -07:00    happyz

6251
5

c5650ed470 · server : avoid context swaps by shifting the KV cache · Updated 2023-09-28 09:03:36 -07:00    happyz

6275
57

72e7ef4e53 · simple : fixes · Updated 2023-09-26 14:19:36 -07:00    happyz

6301
48

784d14ed31 · llama : store non-RoPEd K cache (WIP) · Updated 2023-09-17 13:43:07 -07:00    happyz

6313
5

92a4f86879 · llama : make starcoder graph build more consistent with others · Updated 2023-09-15 07:57:10 -07:00    happyz

6323
20

e7e7b11455 · llama : remove experimental stuff · Updated 2023-09-14 12:52:01 -07:00    happyz

6335
3

2f689dee06 · metal : minor · Updated 2023-09-07 05:33:21 -07:00    happyz

6368
5

30ac7a4117 · gitignore : metal · Updated 2023-09-04 12:23:16 -07:00    happyz

6380
12

f3a84b2e0d · llama : better express the KV cache dependencies in the graph · Updated 2023-09-04 11:44:48 -07:00    happyz

6380
5

c79d130f74 · make : fix speculative build · Updated 2023-09-04 05:50:04 -07:00    happyz

6381
9

847896aba7 · speculative : add --draft CLI arg · Updated 2023-09-03 03:51:07 -07:00    happyz

6387
3

8c2b881281 · cuda : poc for norm quants (only -b 1 works) · Updated 2023-08-30 11:42:28 -07:00    happyz

6428
3

b4e70822f6 · metal : add poc for normalized Q4_0 and Q4_1 · Updated 2023-08-30 08:47:16 -07:00    happyz

6428
7

488e03200e · Merge branch 'master' into gguf-publish-ci · Updated 2023-08-30 01:34:55 -07:00    happyz

6433
4

33a5517d87 · llama.cpp : print gguf version · Updated 2023-08-26 14:56:48 -07:00    happyz

6475
10

d34472c124 · Fix HellaSwag · Updated 2023-08-26 00:55:39 -07:00    happyz

6488
1