llama.cpp/ggml/src/vulkan-shaders
0cc4m a3738b2fa7 vulkan : implement Stable Diffusion operators (ggml/904)
* Fix Vulkan repeat op

* Implement Vulkan concat op

* Delete old Vulkan shader generator

* Implement Vulkan im2col op

* Implement Vulkan unary gelu_quick op

* Implement Vulkan group_norm op

* Implement Vulkan timestep_embedding op

* Implement Vulkan upscale op

* Fix Vulkan vk_context tensor extra index issue

* Fix Vulkan matmul shader parameter bug

* Properly fix Vulkan matmul shader parameter bug

* Add Vulkan ADD f16 + f32 -> f16 operator support

* Implement Vulkan tanh op

* Fix Vulkan group count too large Validation error on non-Nvidia GPUs

* Throw error when too much memory is requested

* Fix another Vulkan group count too large Validation error on non-Nvidia GPUs

* Fix matmul MMQ condition

* Implement Vulkan pad op

* Fix Vulkan crash when tensor is used multiple times in a compute graph

* Add Vulkan CONCAT f16 + f16 -> f16 op

* Add Vulkan LEAKY_RELU op
2024-08-05 08:50:57 +03:00
..
CMakeLists.txt vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
add.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
argsort.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
clamp.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
concat.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
copy.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
dequant_f32.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_funcs.comp Vulkan IQ4_NL Support (#8613) 2024-07-23 10:56:49 +02:00
dequant_head.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_iq4_nl.comp Vulkan IQ4_NL Support (#8613) 2024-07-23 10:56:49 +02:00
dequant_q2_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q3_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q4_0.comp Vulkan IQ4_NL Support (#8613) 2024-07-23 10:56:49 +02:00
dequant_q4_1.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q4_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q5_0.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q5_1.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q5_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q6_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
dequant_q8_0.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
diag_mask_inf.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
div.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
gelu.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
gelu_quick.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
generic_binary_head.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
generic_head.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
generic_unary_head.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
get_rows.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
get_rows_quant.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
group_norm.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
im2col.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
leaky_relu.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
mul.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
mul_mat_split_k_reduce.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_base.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_nc.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_p021.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_q2_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_q3_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_q4_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_q5_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mat_vec_q6_k.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
mul_mm.comp Vulkan IQ4_NL Support (#8613) 2024-07-23 10:56:49 +02:00
norm.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
pad.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
relu.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
rms_norm.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
rope_head.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
rope_neox.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
rope_norm.comp llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
scale.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
silu.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
soft_max.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
square.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
sum_rows.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
tanh.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
timestep_embedding.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
types.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
upscale.comp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00
vulkan-shaders-gen.cpp vulkan : implement Stable Diffusion operators (ggml/904) 2024-08-05 08:50:57 +03:00