llama.cpp

History

Jeff Bolz b3ad3a0191 vulkan: support SET_ROWS (#14587 ) * vulkan: support SET_ROWS Add variants of the copy_to_quant shader that do the SET_ROWS operation. Change these shaders to spread the work across the workgroup. The memory access pattern is probably not great (one thread per quant block), but should be fine for now. * vulkan: optimize set_rows Larger workgroups for non-quant types. Set "norepeat" (there is manual repeat logic). Use fastmod.		2025-07-12 12:12:26 +02:00
..
ggml-blas	cmake : Fix broken CMake error messages (ggml/1252)	2025-06-01 13:43:57 +03:00
ggml-cann	ggml : add ggml_scale_bias (#14417 )	2025-07-09 18:16:12 +02:00
ggml-cpu	ggml : add ggml_scale_bias (#14417 )	2025-07-09 18:16:12 +02:00
ggml-cuda	model : support LiquidAI LFM2 hybrid family (#14620 )	2025-07-11 20:27:01 +02:00
ggml-hip	HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202 )	2025-06-16 13:47:38 +02:00
ggml-metal	ggml : add ggml_scale_bias (#14417 )	2025-07-09 18:16:12 +02:00
ggml-musa	musa: enable fp16 mma (all) and cublas on qy2 (#13842 )	2025-06-26 12:11:59 +08:00
ggml-opencl	opencl: add tiled mul_mat_f16_f32 (#14535 )	2025-07-10 14:58:12 -07:00
ggml-rpc	rpc : nicer error messages for RPC server crash (#14076 )	2025-06-10 09:41:01 +03:00
ggml-sycl	SYCL: Initial set_rows kernel implementation (#14562 )	2025-07-10 09:29:38 +01:00
ggml-vulkan	vulkan: support SET_ROWS (#14587 )	2025-07-12 12:12:26 +02:00
CMakeLists.txt	ggml : remove kompute backend (#14501 )	2025-07-03 07:48:32 +03:00
ggml-alloc.c	ggml: Don't assert fail when tensor data changes (#13222 )	2025-05-01 22:46:10 +02:00
ggml-backend-impl.h	ggml : upgrade init_tensor API to return a ggml_status (#11854 )	2025-02-28 14:41:47 +01:00
ggml-backend-reg.cpp	ggml : remove kompute backend (#14501 )	2025-07-03 07:48:32 +03:00
ggml-backend.cpp	vulkan: Add fusion support for RMS_NORM+MUL (#14366 )	2025-06-29 09:43:36 +02:00
ggml-common.h	ggml-cpu : split arch-specific implementations (#13892 )	2025-06-09 16:47:13 +02:00
ggml-impl.h	vulkan: Add fusion support for RMS_NORM+MUL (#14366 )	2025-06-29 09:43:36 +02:00
ggml-opt.cpp	mnist: fix segmentation fault (ggml/1227)	2025-05-19 13:29:56 +03:00
ggml-quants.c	ggml-quants : rename best_mad to best_error (ggml/1283)	2025-07-01 11:06:39 +03:00
ggml-quants.h	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
ggml-threading.cpp	ggml : build backends as libraries (#10256 )	2024-11-14 18:04:35 +01:00
ggml-threading.h	remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )	2024-12-12 19:02:49 +01:00
ggml.c	ggml : add ggml_scale_bias (#14417 )	2025-07-09 18:16:12 +02:00
ggml.cpp	ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	2025-06-01 13:43:57 +03:00
gguf.cpp	ggml : prevent integer overflow in gguf tensor size calculation (#14595 )	2025-07-09 14:33:53 +02:00