..
add.cl
opencl: allow mixed f16/f32 `add` ( #15140 )
2025-08-12 02:42:41 -07:00
add_id.cl
opencl: add `swiglu_oai` and `add_id` ( #15121 )
2025-08-06 12:12:17 -07:00
argsort.cl
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` ( #13787 )
2025-05-27 12:56:08 -07:00
clamp.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
concat.cl
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ( #13840 )
2025-06-02 16:53:36 -07:00
conv2d.cl
opencl: add conv2d kernel ( #14403 )
2025-07-21 10:03:19 -07:00
conv2d_f16_f32.cl
opencl: add conv2d kernel ( #14403 )
2025-07-21 10:03:19 -07:00
cpy.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
cvt.cl
opencl: initial `q8_0` mv support ( #15732 )
2025-09-21 14:48:44 -07:00
diag_mask_inf.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
div.cl
opencl: add f16 for `add`, `sub`, `mul`, `div` ( #14984 )
2025-08-01 13:15:44 +02:00
embed_kernel.py
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs ( #10693 )
2024-12-13 12:23:52 -08:00
flash_attn_f16.cl
opencl: add attn sinks support for FA kernels ( #15706 )
2025-09-01 23:26:53 -07:00
flash_attn_f32.cl
opencl: add attn sinks support for FA kernels ( #15706 )
2025-09-01 23:26:53 -07:00
flash_attn_f32_f16.cl
opencl: add attn sinks support for FA kernels ( #15706 )
2025-09-01 23:26:53 -07:00
gelu.cl
opencl: add GELU_ERF ( #14476 )
2025-07-04 23:24:56 -07:00
gemv_noshuffle.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
gemv_noshuffle_general.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
get_rows.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
glu.cl
opencl: add `swiglu_oai` and `add_id` ( #15121 )
2025-08-06 12:12:17 -07:00
group_norm.cl
OpenCL: add fused group_norm/norm, mul, add ( #15314 )
2025-08-26 23:36:05 -07:00
im2col_f16.cl
opencl: fix `im2col` when `KW!=KH` ( #14803 )
2025-07-21 13:55:10 -07:00
im2col_f32.cl
opencl: fix `im2col` when `KW!=KH` ( #14803 )
2025-07-21 13:55:10 -07:00
mul.cl
opencl: add f16 for `add`, `sub`, `mul`, `div` ( #14984 )
2025-08-01 13:15:44 +02:00
mul_mat_Ab_Bi_8x4.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mat_f16_f32.cl
opencl: add tiled mul_mat_f16_f32 ( #14535 )
2025-07-10 14:58:12 -07:00
mul_mm_f16_f32_l4_lm.cl
opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` ( #14809 )
2025-07-30 14:56:55 -07:00
mul_mm_f32_f32_l4_lm.cl
opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f32_l4_lm` ( #14809 )
2025-07-30 14:56:55 -07:00
mul_mv_f16_f16.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_f16_f32.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_f16_f32_1row.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_f16_f32_l4.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_f32_f32.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_id_mxfp4_f32.cl
opencl: add initial mxfp4 support via mv ( #15270 )
2025-08-15 09:52:14 -07:00
mul_mv_id_mxfp4_f32_flat.cl
opencl: optimize mxfp4 kernels ( #16037 )
2025-09-18 12:03:34 -07:00
mul_mv_id_q4_0_f32_8x_flat.cl
opencl: add `mul_mv_id_q4_0_f32_8x_flat` ( #14003 )
2025-06-10 16:55:58 -07:00
mul_mv_id_q8_0_f32.cl
opencl: initial `q8_0` mv support ( #15732 )
2025-09-21 14:48:44 -07:00
mul_mv_id_q8_0_f32_flat.cl
opencl: initial `q8_0` mv support ( #15732 )
2025-09-21 14:48:44 -07:00
mul_mv_mxfp4_f32.cl
opencl: add initial mxfp4 support via mv ( #15270 )
2025-08-15 09:52:14 -07:00
mul_mv_mxfp4_f32_flat.cl
opencl: optimize mxfp4 kernels ( #16037 )
2025-09-18 12:03:34 -07:00
mul_mv_q4_0_f32.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_1d_8x_flat.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_1d_16x_flat.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_8x_flat.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q4_0_f32_v.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q6_k.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
mul_mv_q8_0_f32.cl
opencl: initial `q8_0` mv support ( #15732 )
2025-09-21 14:48:44 -07:00
mul_mv_q8_0_f32_flat.cl
opencl: initial `q8_0` mv support ( #15732 )
2025-09-21 14:48:44 -07:00
norm.cl
OpenCL: add fused group_norm/norm, mul, add ( #15314 )
2025-08-26 23:36:05 -07:00
pad.cl
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ( #13840 )
2025-06-02 16:53:36 -07:00
relu.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
repeat.cl
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ( #13840 )
2025-06-02 16:53:36 -07:00
rms_norm.cl
opencl: add fused `rms_norm_mul` ( #14841 )
2025-07-25 17:12:13 +02:00
rope.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
scale.cl
ggml : add ggml_scale_bias ( #14417 )
2025-07-09 18:16:12 +02:00
set_rows.cl
ggml : implement set_rows with i32 index ( #16159 )
2025-09-22 19:13:00 +02:00
sigmoid.cl
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` ( #13787 )
2025-05-27 12:56:08 -07:00
silu.cl
opencl: split `ggml-opencl.cl` into multiple files and cleanup ( #12886 )
2025-04-15 12:26:00 -07:00
softmax_4_f16.cl
opencl: support sink in `soft_max` (attn sinks) ( #15152 )
2025-08-07 21:47:03 -07:00
softmax_4_f32.cl
opencl: support sink in `soft_max` (attn sinks) ( #15152 )
2025-08-07 21:47:03 -07:00
softmax_f16.cl
opencl: support sink in `soft_max` (attn sinks) ( #15152 )
2025-08-07 21:47:03 -07:00
softmax_f32.cl
opencl: support sink in `soft_max` (attn sinks) ( #15152 )
2025-08-07 21:47:03 -07:00
sub.cl
opencl: add f16 for `add`, `sub`, `mul`, `div` ( #14984 )
2025-08-01 13:15:44 +02:00
sum_rows.cl
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` ( #13787 )
2025-05-27 12:56:08 -07:00
tanh.cl
OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat ( #13840 )
2025-06-02 16:53:36 -07:00
transpose.cl
opencl: add initial mxfp4 support via mv ( #15270 )
2025-08-15 09:52:14 -07:00
tsembd.cl
ggml : fix padding in timestep embedding kernels ( #15932 )
2025-09-16 15:25:57 +02:00
upscale.cl
opencl : update upscale to support align corners ( #14488 )
2025-07-02 09:07:42 +02:00