llama.cpp/ggml/src/ggml-opencl/kernels
lhez 6e6725459a
opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` (#14809)
2025-07-30 14:56:55 -07:00
..
add.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
argsort.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
clamp.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
concat.cl OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840) 2025-06-02 16:53:36 -07:00
conv2d.cl opencl: add conv2d kernel (#14403) 2025-07-21 10:03:19 -07:00
conv2d_f16_f32.cl opencl: add conv2d kernel (#14403) 2025-07-21 10:03:19 -07:00
cpy.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
cvt.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
diag_mask_inf.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
div.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
embed_kernel.py Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693) 2024-12-13 12:23:52 -08:00
gelu.cl opencl: add GELU_ERF (#14476) 2025-07-04 23:24:56 -07:00
gemv_noshuffle.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
gemv_noshuffle_general.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
get_rows.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
glu.cl ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445) 2025-07-03 23:07:22 +02:00
group_norm.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
im2col_f16.cl opencl: fix `im2col` when `KW!=KH` (#14803) 2025-07-21 13:55:10 -07:00
im2col_f32.cl opencl: fix `im2col` when `KW!=KH` (#14803) 2025-07-21 13:55:10 -07:00
mul.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mat_Ab_Bi_8x4.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mat_f16_f32.cl opencl: add tiled mul_mat_f16_f32 (#14535) 2025-07-10 14:58:12 -07:00
mul_mm_f16_f32_l4_lm.cl opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` (#14809) 2025-07-30 14:56:55 -07:00
mul_mm_f32_f32_l4_lm.cl opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` (#14809) 2025-07-30 14:56:55 -07:00
mul_mv_f16_f16.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_f16_f32.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_f16_f32_1row.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_f16_f32_l4.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_f32_f32.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_id_q4_0_f32_8x_flat.cl opencl: add `mul_mv_id_q4_0_f32_8x_flat` (#14003) 2025-06-10 16:55:58 -07:00
mul_mv_q4_0_f32.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_1d_8x_flat.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_1d_16x_flat.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_8x_flat.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_v.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
mul_mv_q6_k.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
norm.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
pad.cl OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840) 2025-06-02 16:53:36 -07:00
relu.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
repeat.cl OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840) 2025-06-02 16:53:36 -07:00
rms_norm.cl opencl: add fused `rms_norm_mul` (#14841) 2025-07-25 17:12:13 +02:00
rope.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
scale.cl ggml : add ggml_scale_bias (#14417) 2025-07-09 18:16:12 +02:00
set_rows.cl opencl: add `set_rows` for `f16` and `f32` (#14547) 2025-07-10 11:48:52 -07:00
sigmoid.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
silu.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
softmax_4_f16.cl opencl : broadcast for soft_max (#14510) 2025-07-03 20:22:24 +02:00
softmax_4_f32.cl opencl : broadcast for soft_max (#14510) 2025-07-03 20:22:24 +02:00
softmax_f16.cl opencl : broadcast for soft_max (#14510) 2025-07-03 20:22:24 +02:00
softmax_f32.cl opencl : broadcast for soft_max (#14510) 2025-07-03 20:22:24 +02:00
sub.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
sum_rows.cl opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787) 2025-05-27 12:56:08 -07:00
tanh.cl OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840) 2025-06-02 16:53:36 -07:00
transpose.cl opencl: split `ggml-opencl.cl` into multiple files and cleanup (#12886) 2025-04-15 12:26:00 -07:00
tsembd.cl OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (#13840) 2025-06-02 16:53:36 -07:00
upscale.cl opencl : update upscale to support align corners (#14488) 2025-07-02 09:07:42 +02:00