llama.cpp/ggml/src/ggml-opencl
lhez ece0f5c177
opencl: add fastdiv and use it in set_rows, ported from cuda (#17090)
* opencl: add fastdiv for mm q8_0

* opencl: use uint4 for fastdiv vals

* opencl: use fastdiv for set_rows

* opencl: do not use fastdiv for q8_0 mm
2025-11-10 15:00:13 -08:00
..
kernels opencl: add fastdiv and use it in set_rows, ported from cuda (#17090) 2025-11-10 15:00:13 -08:00
CMakeLists.txt opencl: transposed gemm/gemv moe kernel with mxfp4,f32 (#16602) 2025-10-17 17:55:32 -07:00
ggml-opencl.cpp opencl: add fastdiv and use it in set_rows, ported from cuda (#17090) 2025-11-10 15:00:13 -08:00