zhanmyz
19ec9b6bf5
Try to add VIEW node to OV Frontend and have some issues that need to be dealt with
2026-01-15 10:05:41 -08:00
zhanmyz
b14b49d5f6
Minor Update
2026-01-15 10:05:41 -08:00
zhanmyz
467a5ddf04
1. Update the implementation of CPY node when it's non-contiguous
...
2. Remove duplicate get node operation function
2026-01-15 10:05:41 -08:00
zhanmyz
cff473a9e2
1. All operators implemented using OpenVINO can be successfully executed individually.
...
2. VIEW op output tensor shape is not same with CONT(non-contiguous) input tensor shape
3. CPY(non-contiguous) can't be implemented with original input/output tensor shape and data(need change the original shape when create input/output tensor)
Currently. VIEW op executed in the ggml backend and others executed in the OpenVINO Frontend.
2026-01-15 10:05:41 -08:00
zhanmyz
e08a7fda33
All adjacent ops can conversion but calculation result is wrong and need debugging
2026-01-15 10:05:41 -08:00
zhanmyz
d05c458421
change CONT and MULMAT input node shape
2026-01-15 10:05:41 -08:00
zhanmyz
246a2d1021
Change the input and ouput node shape of MUL_MAT operator
2026-01-15 10:05:41 -08:00
zhanmyz
f37fa21a5c
Change the input and ouput node shape of MUL_MAT operator
2026-01-15 10:05:41 -08:00
zhanmyz
f98d215162
Change the input parameter shape of CONT operator
2026-01-15 10:05:41 -08:00
zhanmyz
9a7b7d8d6d
OV Frontend supports GET_ROWS/RMS_NORM/MUL/MUL_MAT/ROPE/SCALE/SOFTMAX/ADD adjacent op graph conversion
2026-01-15 10:05:41 -08:00
zhanmyz
95ae982d59
OV Frontend supports GET_ROWS/RMS_NORM/MUL/MUL_MAT graph conversion of consecutive OPs
2026-01-15 10:05:41 -08:00
zhanmyz
901f7347ff
Execute CONT & VIEW operators in OV Frontend is OK
2026-01-15 10:05:41 -08:00
zhanmyz
081b52667b
Execute singel CONT operator is OK
2026-01-15 10:05:41 -08:00
zhanmyz
afb8594194
add tmp source code files
2026-01-15 10:05:41 -08:00
zhanmyz
57582fda39
add implementation of CPY when the output tensor is non-contiguous
2026-01-15 10:05:41 -08:00
zhanmyz
8484769981
add implementation of MUL_MAT, CPY, CONT of GGML ops using OV ops
2026-01-15 10:05:41 -08:00
zhanmyz
cb2729bc4a
Move CPY from GGML OV Backend to OV Frontend
2026-01-15 10:05:41 -08:00
zhanmyz
2b04bd43be
Add MUL_MAT,CPY,CONT as operators implemented in OpenVINO for GGML backend
2026-01-15 10:05:41 -08:00
zhanmyz
0f7d07de7d
Add support for RMS_NORM OP
2026-01-15 10:05:41 -08:00
yumengbo
8aba03bac6
Support Softmax op
2026-01-15 10:05:41 -08:00
yumengbo
d218c61e6d
Support Softmax op
2026-01-15 10:05:41 -08:00
yumengbo
590f587b27
Add support for UNARY SILU op . Fix pytorch impl bugs.
2026-01-15 10:05:41 -08:00
zhanmyz
8c5a609f8d
add the rms_norm operator implemented using OpenVINO to the GGML backend of llama.cpp
2026-01-15 10:05:41 -08:00
zhanmyz
80c330a469
Update build.md and add operation mapping(GGML to OpenVINO)
2026-01-15 10:05:41 -08:00
zhanmyz
49804f43fc
add GET_ROWS operator of OpenVINO to GGML of llama.cpp
2026-01-15 10:05:41 -08:00
yumengbo
9b7b63d12c
Convert subgraph with add, sub, mul, div op to ov model and do infer on openvino device
2026-01-15 10:05:41 -08:00
yumengbo
171c4681f4
Add PoC of integration of openvino frontend. Main changes: ggml-ov-frontend-utils, GraphIterator, Decoder
2026-01-15 10:05:41 -08:00
zhanmyz
ee31dc1c1b
add get openvino available ops function
2026-01-15 10:05:41 -08:00
zhanmyz
77d68146a8
add OpenVINO frontend convert process steps
2026-01-15 10:05:41 -08:00
zhanmyz
0a81aa19f7
Add compile options
2026-01-15 10:05:40 -08:00
zhanmyz
adc2c70f44
Add OpenVINO MUL operator to GGML of Llama.cpp.
2026-01-15 10:05:40 -08:00
zhanmyz
faa4a7de76
Solve the issue of abnormal model output caused by using OpenVINO ADD operator
2026-01-15 10:05:40 -08:00
zhanmyz
9b9d51dddf
* Configure the device(default CPU) that uses OpenVINO to compile the model
...
* Add OpenVINO ADD operator to Llama.cpp. The output is somewhat abnormal and needs further debugging.
2026-01-15 10:05:40 -08:00
zhanmyz
5294402b50
add openvino as optional backend for Llama.cpp ggml
2026-01-15 10:05:40 -08:00
Yanglei Zou
fe5720e684
Add ggml-openvino base files
2026-01-15 10:05:40 -08:00