Commit Graph

6 Commits

Author SHA1 Message Date
Julia Longtin 867de5edce use different restrict syntax, to make g++ happy. 2024-05-09 23:08:43 +00:00
Julia Longtin af4ee51fa7 add batch fp16<->fp32 conversion functions. 2024-05-09 19:31:28 +00:00
Julia Longtin 81ca166ecd minor spacing and comment changes. 2024-05-09 16:57:59 +00:00
Julia Longtin e298d9e65e further optimizations. 0.99 tokens per second. 2024-04-22 18:16:28 +00:00
Julia Longtin 96fdd214c8 indent headers consistently. 2024-04-03 19:01:18 +00:00
Julia Longtin a7bd64c130 begin work on targeting dot_q5_K_q8_K. 2024-03-23 14:19:47 +00:00