llama.cpp

History

Jeff Bolz c6c5e85979 vulkan: support solve_tri with larger N/K values (#17781 ) Split N into chunks to fit into shared memory. If K > 128, use a larger workgroup with enough invocations. Add perf tests matching qwen3next.		2025-12-06 08:56:45 +01:00
..
cmake	ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )	2025-08-07 13:45:41 +02:00
include	rpc : fix alloc size logic (#17116 )	2025-12-05 19:39:04 +02:00
src	vulkan: support solve_tri with larger N/K values (#17781 )	2025-12-06 08:56:45 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	build : move _WIN32_WINNT definition to headers (#17736 )	2025-12-04 07:04:02 +01:00