llama.cpp

History

shalinib-ibm 55c509daf5 ggml : refactor llamafile_sgemm PPC code (#14673 ) Remove un-necessary templates from class definition and packing functions Reduce deeply nested conditionals, if-else switching in mnapck function Replace repetitive code with inline functions in Packing functions 2 ~ 7% improvement in Q8 Model 15 ~ 50% improvement in Q4 Model Signed-off-by: Shalini Salomi Bodapati <Shalini.Salomi.Bodapati@ibm.com>		2025-07-14 16:16:42 +03:00
..
cmake	ggml-cpu : rework weak alias on apple targets (#14146 )	2025-06-16 13:54:15 +08:00
include	ggml : add ggml_scale_bias (#14417 )	2025-07-09 18:16:12 +02:00
src	ggml : refactor llamafile_sgemm PPC code (#14673 )	2025-07-14 16:16:42 +03:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	ggml : remove kompute backend (#14501 )	2025-07-03 07:48:32 +03:00