llama.cpp/ggml/src
Jeff Bolz b3ad3a0191
vulkan: support SET_ROWS (#14587)
* vulkan: support SET_ROWS

Add variants of the copy_to_quant shader that do the SET_ROWS operation.
Change these shaders to spread the work across the workgroup.
The memory access pattern is probably not great (one thread per quant block),
but should be fine for now.

* vulkan: optimize set_rows

Larger workgroups for non-quant types.
Set "norepeat" (there is manual repeat logic).
Use fastmod.
2025-07-12 12:12:26 +02:00
..
ggml-blas cmake : Fix broken CMake error messages (ggml/1252) 2025-06-01 13:43:57 +03:00
ggml-cann ggml : add ggml_scale_bias (#14417) 2025-07-09 18:16:12 +02:00
ggml-cpu ggml : add ggml_scale_bias (#14417) 2025-07-09 18:16:12 +02:00
ggml-cuda model : support LiquidAI LFM2 hybrid family (#14620) 2025-07-11 20:27:01 +02:00
ggml-hip HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202) 2025-06-16 13:47:38 +02:00
ggml-metal ggml : add ggml_scale_bias (#14417) 2025-07-09 18:16:12 +02:00
ggml-musa musa: enable fp16 mma (all) and cublas on qy2 (#13842) 2025-06-26 12:11:59 +08:00
ggml-opencl opencl: add tiled mul_mat_f16_f32 (#14535) 2025-07-10 14:58:12 -07:00
ggml-rpc rpc : nicer error messages for RPC server crash (#14076) 2025-06-10 09:41:01 +03:00
ggml-sycl SYCL: Initial set_rows kernel implementation (#14562) 2025-07-10 09:29:38 +01:00
ggml-vulkan vulkan: support SET_ROWS (#14587) 2025-07-12 12:12:26 +02:00
CMakeLists.txt ggml : remove kompute backend (#14501) 2025-07-03 07:48:32 +03:00
ggml-alloc.c ggml: Don't assert fail when tensor data changes (#13222) 2025-05-01 22:46:10 +02:00
ggml-backend-impl.h ggml : upgrade init_tensor API to return a ggml_status (#11854) 2025-02-28 14:41:47 +01:00
ggml-backend-reg.cpp ggml : remove kompute backend (#14501) 2025-07-03 07:48:32 +03:00
ggml-backend.cpp vulkan: Add fusion support for RMS_NORM+MUL (#14366) 2025-06-29 09:43:36 +02:00
ggml-common.h ggml-cpu : split arch-specific implementations (#13892) 2025-06-09 16:47:13 +02:00
ggml-impl.h vulkan: Add fusion support for RMS_NORM+MUL (#14366) 2025-06-29 09:43:36 +02:00
ggml-opt.cpp mnist: fix segmentation fault (ggml/1227) 2025-05-19 13:29:56 +03:00
ggml-quants.c ggml-quants : rename best_mad to best_error (ggml/1283) 2025-07-01 11:06:39 +03:00
ggml-quants.h ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
ggml-threading.cpp ggml : build backends as libraries (#10256) 2024-11-14 18:04:35 +01:00
ggml-threading.h remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 2024-12-12 19:02:49 +01:00
ggml.c ggml : add ggml_scale_bias (#14417) 2025-07-09 18:16:12 +02:00
ggml.cpp ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 2025-06-01 13:43:57 +03:00
gguf.cpp ggml : prevent integer overflow in gguf tensor size calculation (#14595) 2025-07-09 14:33:53 +02:00