llama.cpp/ggml/src/ggml-sycl
shani-f 72bd7321a7
sycl : unify unary kernels with a generic implementation and enable wide operator support (#17213)
* SYCL: add generic unary op implementation for multiple ops (ABS/SGN/…); unify non-contiguous access

* SYCL: update documentation and sycl.csv to reflect new unary op support

* update ops.md after syncing SYCL.csv changes

* Fix SYCL.csv merge conflict

* Update ops.md after fixing SYCL.csv conflicts

* Fix SYCL.csv tail after merge conflict and regenerate ops.md

* Fix line endings and final newline in SYCL.csv

* Remove TOPK_MOE entries from SYCL.csv as requested

* Update ops.md after removing TOPK_MOE from SYCL.csv

* Regenerated SYCL.csv and synced ops.md with upstream

* Update ops.md using create_ops_docs.py
2025-11-16 00:52:42 +01:00
..
dpct [SYCL] refactor soft_max, add soft_max_back (#16472) 2025-10-09 10:25:11 +03:00
CMakeLists.txt
backend.hpp sycl: add SSM_CONV operation support (#16800) 2025-10-28 09:50:33 +08:00
binbcast.cpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
binbcast.hpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
common.cpp
common.hpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
concat.cpp sycl: add CONCAT operator support (#16047) 2025-11-06 11:02:33 +01:00
concat.hpp
conv.cpp
conv.hpp
convert.cpp
convert.hpp
count-equal.cpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
count-equal.hpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
cpy.cpp
cpy.hpp
dequantize.hpp
dmmv.cpp
dmmv.hpp
element_wise.cpp sycl : unify unary kernels with a generic implementation and enable wide operator support (#17213) 2025-11-16 00:52:42 +01:00
element_wise.hpp SYCL: Add support for FLOOR,CEIL,ROUND and TRUNC unary operators (#16613) 2025-10-20 11:08:32 +03:00
gemm.hpp
getrows.cpp
getrows.hpp
ggml-sycl.cpp sycl : unify unary kernels with a generic implementation and enable wide operator support (#17213) 2025-11-16 00:52:42 +01:00
gla.cpp
gla.hpp
im2col.cpp
im2col.hpp
mmq.cpp
mmq.hpp
mmvq.cpp
mmvq.hpp
norm.cpp sycl: add RMS_NORM_BACK operation support (#16808) 2025-10-29 14:14:39 +08:00
norm.hpp sycl: add RMS_NORM_BACK operation support (#16808) 2025-10-29 14:14:39 +08:00
outprod.cpp
outprod.hpp
pad.cpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
pad.hpp [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) 2025-10-12 21:53:35 +08:00
pad_reflect_1d.cpp sycl : add PAD_REFLECT_D1 operator support (#16145) 2025-10-21 00:21:12 +02:00
pad_reflect_1d.hpp sycl : add PAD_REFLECT_D1 operator support (#16145) 2025-10-21 00:21:12 +02:00
presets.hpp SYCL SET operator optimized for F32 tensors (#16350) 2025-10-17 10:36:40 +08:00
quantize.hpp
quants.hpp
repeat_back.cpp SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (#16869) 2025-11-03 09:35:33 +08:00
repeat_back.hpp sycl: add REPEAT_BACK operation support (#16734) 2025-10-27 09:19:50 +08:00
roll.cpp sycl: add ROLL operation support (#16665) 2025-10-27 09:20:24 +08:00
roll.hpp sycl: add ROLL operation support (#16665) 2025-10-27 09:20:24 +08:00
rope.cpp model: add support for qwen3vl series (#16780) 2025-10-30 16:19:14 +01:00
rope.hpp
set.cpp SYCL SET operator optimized for F32 tensors (#16350) 2025-10-17 10:36:40 +08:00
set.hpp SYCL SET operator optimized for F32 tensors (#16350) 2025-10-17 10:36:40 +08:00
set_rows.cpp ggml : implement set_rows with i32 index (#16159) 2025-09-22 19:13:00 +02:00
set_rows.hpp
softmax.cpp [SYCL] refactor soft_max, add soft_max_back (#16472) 2025-10-09 10:25:11 +03:00
softmax.hpp [SYCL] refactor soft_max, add soft_max_back (#16472) 2025-10-09 10:25:11 +03:00
ssm_conv.cpp sycl: add SSM_CONV operation support (#16800) 2025-10-28 09:50:33 +08:00
ssm_conv.hpp sycl: add SSM_CONV operation support (#16800) 2025-10-28 09:50:33 +08:00
sycl_hw.cpp
sycl_hw.hpp
tsembd.cpp
tsembd.hpp
vecdotq.hpp
wkv.cpp
wkv.hpp