llama.cpp/ggml/src/ggml-opencl
lhez 9c96465f99
opencl: enable the general fp mm for non-cont input and as a fallback for specialized kqv kernel for adreno (#18970)
* opencl: add `copy_to_contiguous` and utilize mm kernels

* opencl: only copy to cont for f32 and f16 tensors

* opencl: use cont mm for fallback when dst is large

* opencl: use nb local to copy-to-cont

* opencl: use local offset as well
2026-01-22 10:29:25 -08:00
..
kernels opencl: add TRI op support (#18979) 2026-01-21 22:05:54 -08:00
CMakeLists.txt opencl: add TRI op support (#18979) 2026-01-21 22:05:54 -08:00
ggml-opencl.cpp opencl: enable the general fp mm for non-cont input and as a fallback for specialized kqv kernel for adreno (#18970) 2026-01-22 10:29:25 -08:00