Julia Longtin
|
b00607d1ab
|
use vbroadcastss in place of vbroadcast32x4.
|
2024-05-10 15:52:35 +00:00 |
Julia Longtin
|
f6edcc4061
|
Use a vectorized assembly function to handle remaining chunks less than vector wide.
|
2024-05-10 14:52:46 +00:00 |
Julia Longtin
|
2282ac4d9f
|
broadcast a single int8, instead of 4 of them.
|
2024-05-10 14:19:27 +00:00 |
Julia Longtin
|
81ca166ecd
|
minor spacing and comment changes.
|
2024-05-09 16:57:59 +00:00 |
Julia Longtin
|
53773e0b4a
|
replace tabs with spaces.
|
2024-04-03 23:42:34 +00:00 |
Julia Longtin
|
9152143fe7
|
reformat, and label what these files are.
|
2024-04-03 23:21:24 +00:00 |
Julia Longtin
|
6f67ea886f
|
formatting changes.
|
2024-04-03 20:24:00 +00:00 |
Julia Longtin
|
bb5eb95816
|
use better memory save operator.
|
2024-03-23 20:49:11 +00:00 |
Julia Longtin
|
8f57803f58
|
import stdio.h for size_t.
|
2024-03-23 14:29:59 +00:00 |
Julia Longtin
|
9bcb8350d5
|
import stdint.h for sizeSt.
|
2024-03-23 14:28:29 +00:00 |
Julia Longtin
|
ac3637142d
|
formatting changes.
|
2024-03-20 21:34:12 +00:00 |
Julia Longtin
|
ee27148629
|
remove intrinsics import, and use upConv to save 12 bytes of memory transit.
|
2024-03-20 20:15:30 +00:00 |
Julia Longtin
|
ab6f3a8a8d
|
Update ggml-phi-knc.c
|
2024-03-17 21:36:14 +00:00 |
Julia Longtin
|
fe663c1b63
|
merge from upstream
|
2024-03-17 21:15:32 +00:00 |
Julia Longtin
|
717e164dd7
|
implement F32 dot products.
|
2024-03-16 14:05:03 +00:00 |