llama.cpp

History

chraac c2fe8a12bb ggml-hexagon: streamline flash attention operations by removing redundant checks for FP32		2026-02-03 00:14:51 +08:00
..
htp	ggml-hexagon: streamline flash attention operations by removing redundant checks for FP32	2026-02-03 00:14:51 +08:00
CMakeLists.txt	ggml-hexagon: flash-attention and reduce-sum optimizations (#19141 )	2026-01-30 21:14:20 -08:00
ggml-hexagon.cpp	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
htp-drv.cpp	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
htp-drv.h	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
libdl.h	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
libggml-htp.inf	hexagon: enable offloading to Hexagon on Windows on Snapdragon (#19150 )	2026-01-29 12:33:21 -08:00
op-desc.h	ggml-hexagon: create generalized functions for cpu side op (#17500 )	2025-12-22 23:13:24 -08:00