Julia Longtin
|
ca0dc26704
|
loosen alignment requirements for zeros, add missing function, and promote aux8 to an array of vectors.
|
2024-03-24 13:35:05 +00:00 |
Julia Longtin
|
cf481cf901
|
promote aux8 into a vector.
|
2024-03-24 12:50:01 +00:00 |
Julia Longtin
|
169a145409
|
fix our reference to src in the second place, and use a more accurate comment.
|
2024-03-24 12:41:21 +00:00 |
Julia Longtin
|
c28bfe4552
|
spacing changes, eliminate dead references to k1 or zero, and use the right type when referring to src.
|
2024-03-24 12:37:47 +00:00 |
Julia Longtin
|
ba4f4129b3
|
better comments, and fix some small errors.
|
2024-03-24 12:17:06 +00:00 |
Julia Longtin
|
03a3e0eb7a
|
perform 16 operations at a time.
|
2024-03-24 12:04:44 +00:00 |
Julia Longtin
|
5935bb34f4
|
use proper mov operator, and pass addresses.
|
2024-03-23 23:46:36 +00:00 |
Julia Longtin
|
a5132a1507
|
attempt our first FMA.
|
2024-03-23 22:16:57 +00:00 |
Julia Longtin
|
4477b8e123
|
add I32 vector memory clearing.
|
2024-03-23 21:16:23 +00:00 |
Julia Longtin
|
ea1edb0600
|
promote aux32 to a vector.
|
2024-03-23 21:12:35 +00:00 |
Julia Longtin
|
f967690a41
|
add missing address of operators.
|
2024-03-23 21:05:50 +00:00 |
Julia Longtin
|
2fdd11fe3a
|
promote aux16 to a vector.
|
2024-03-23 21:00:51 +00:00 |
Julia Longtin
|
f09b3ed79e
|
use quotes properly.
|
2024-03-23 20:53:16 +00:00 |
Julia Longtin
|
9d7ca41703
|
expand mask, and align memory.
|
2024-03-23 20:48:43 +00:00 |
Julia Longtin
|
bd6d7e6238
|
try to use vectorized zeroing function.
|
2024-03-23 19:55:12 +00:00 |
Julia Longtin
|
f985372e3a
|
add missing variable.
|
2024-03-23 19:49:16 +00:00 |
Julia Longtin
|
31d4f9312b
|
copy right block.
|
2024-03-23 19:47:21 +00:00 |
Julia Longtin
|
f092a10dc9
|
promote aux16 into a vector. (part three)
|
2024-03-23 16:27:11 +00:00 |
Julia Longtin
|
c72157a5a6
|
promote aux16 into a vector.
|
2024-03-23 16:24:11 +00:00 |
Julia Longtin
|
e3503c924a
|
promote aux16 into a vector.
|
2024-03-23 16:21:20 +00:00 |
Julia Longtin
|
6face8a0be
|
first fixes.
|
2024-03-23 15:56:47 +00:00 |
Julia Longtin
|
0a2051aa88
|
attempt to speed up float clearing.
|
2024-03-23 15:55:00 +00:00 |
Julia Longtin
|
0b3f17127f
|
force to compile.
|
2024-03-23 14:58:33 +00:00 |
Julia Longtin
|
18f353987c
|
tell ggml-common.h to export what we want.
|
2024-03-23 14:49:35 +00:00 |
Julia Longtin
|
cd20404250
|
pull in ggml specific types.
|
2024-03-23 14:38:15 +00:00 |
Julia Longtin
|
8f57803f58
|
import stdio.h for size_t.
|
2024-03-23 14:29:59 +00:00 |
Julia Longtin
|
9bcb8350d5
|
import stdint.h for sizeSt.
|
2024-03-23 14:28:29 +00:00 |
Julia Longtin
|
a7bd64c130
|
begin work on targeting dot_q5_K_q8_K.
|
2024-03-23 14:19:47 +00:00 |