Julia Longtin
|
867de5edce
|
use different restrict syntax, to make g++ happy.
|
2024-05-09 23:08:43 +00:00 |
Julia Longtin
|
af4ee51fa7
|
add batch fp16<->fp32 conversion functions.
|
2024-05-09 19:31:28 +00:00 |
Julia Longtin
|
81ca166ecd
|
minor spacing and comment changes.
|
2024-05-09 16:57:59 +00:00 |
Julia Longtin
|
e298d9e65e
|
further optimizations. 0.99 tokens per second.
|
2024-04-22 18:16:28 +00:00 |
Julia Longtin
|
96fdd214c8
|
indent headers consistently.
|
2024-04-03 19:01:18 +00:00 |
Julia Longtin
|
a7bd64c130
|
begin work on targeting dot_q5_K_q8_K.
|
2024-03-23 14:19:47 +00:00 |