llama.cpp

History

lhez 9c96465f99 opencl: enable the general fp mm for non-cont input and as a fallback for specialized kqv kernel for adreno (#18970 ) * opencl: add `copy_to_contiguous` and utilize mm kernels * opencl: only copy to cont for f32 and f16 tensors * opencl: use cont mm for fallback when dst is large * opencl: use nb local to copy-to-cont * opencl: use local offset as well		2026-01-22 10:29:25 -08:00
..
kernels	opencl: add TRI op support (#18979 )	2026-01-21 22:05:54 -08:00
CMakeLists.txt	opencl: add TRI op support (#18979 )	2026-01-21 22:05:54 -08:00
ggml-opencl.cpp	opencl: enable the general fp mm for non-cont input and as a fallback for specialized kqv kernel for adreno (#18970 )	2026-01-22 10:29:25 -08:00