llama.cpp

Commit Graph

Author	SHA1	Message	Date
hongruichen	30b40006cc	remove unused declarations	2024-07-14 23:50:11 +08:00
hongruichen	148ceab70c	add log op	2024-07-14 23:00:50 +08:00
hongruichen	100ccd5e7f	add unary op template and more ops	2024-07-13 00:55:34 +08:00
hongruichen	e3aa43adbd	suppress warning	2024-07-12 23:26:11 +08:00
hongruichen	f0894d897a	wip wip	2024-07-12 19:57:34 +08:00
hongruichen	be3aa9631f	use template function directly	2024-07-11 11:18:06 +08:00
hongruichen	8932135fdb	add sqrt and mul ops	2024-07-11 00:08:08 +08:00
hongruichen	7ea28a6fac	add helper function for binary op	2024-07-10 23:39:03 +08:00
hongruichen	b6f29273f0	add function to get graph from cache	2024-07-10 23:08:32 +08:00
hongruichen	80051cfc4d	remove unused variables	2024-07-10 19:57:47 +08:00
hongruichen	b49b501e26	fix sprintf type	2024-07-10 19:48:57 +08:00
hongruichen	3feb574bf0	merge register_rpc_mem into alloc_rpc_mem	2024-07-10 19:40:02 +08:00
hongruichen	e97d3a6c48	fix tensor buffer allocation add log commit qnn buffer after changed add log register_rpc_mem 2 times update input tensors before graph finalize default to QNN_TENSORMEMTYPE_RAW set new tensors at execute move write input tensors to exec check if mem registered before actual do register rpc mem once allocated	2024-07-10 19:32:39 +08:00
hongruichen	dc7d83e121	add log	2024-07-10 00:33:23 +08:00
hongruichen	9add256efe	use helper function instead	2024-07-10 00:31:39 +08:00
hongruichen	a7be0693ba	add log	2024-07-10 00:29:43 +08:00
hongruichen	af869fd636	fix compiling error in debug build	2024-07-10 00:23:51 +08:00
Hongrui Chen	5f2e3918f6	refactoring ggml_qnn_tensor	2024-07-09 19:58:46 +08:00
Hongrui Chen	874216b9c8	remove unused members	2024-07-07 22:32:43 +08:00
hongruichen	263ffa962e	small opt of the qnn graph config init	2024-07-05 23:07:27 +08:00
hongruichen	4b0f6b0cd6	add helper function to get Qnn_TensorType_t from ggml_tensor	2024-07-05 19:37:58 +08:00
hongruichen	0f2e68713c	move tensor related function to utils	2024-07-05 19:02:38 +08:00
hongruichen	58cec14092	reformat	2024-07-05 17:38:54 +08:00
hongruichen	13dc3a02c3	use qnn graph inside add and mul ops	2024-07-05 13:27:16 +08:00
hongruichen	a688ed324b	add op param to add_nodes	2024-07-05 13:07:48 +08:00
hongruichen	4b2ee61f62	move graph map to backend object	2024-07-05 11:58:47 +08:00
hongruichen	ca0d999c2a	add ggml_qnn_graph	2024-07-05 11:35:18 +08:00
hongruichen	000240cf62	add clang format file and reformating	2024-07-04 23:29:31 +08:00
hongruichen	38f88d5fb1	fix compiling error after merge latest master	2024-07-03 00:13:53 +08:00
hongruichen	8b677d1b2f	move qnn backend into sub folder	2024-07-02 19:42:14 +08:00
zhentaoyu	197fe6c1d7	[SYCL] Update SYCL-Rope op and Refactor (#8157 ) * align with rope.cu and move sycl-op to a single file	2024-07-01 19:39:06 +08:00
Johannes Gäßler	85a267daaa	CUDA: fix MMQ stream-k for --split-mode row (#8167 )	2024-06-27 16:26:05 +02:00
slaren	31ec3993f6	ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (#8140 )	2024-06-26 21:34:14 +02:00
Georgi Gerganov	f3f65429c4	llama : reorganize source code + improve CMake (#8006 ) * scripts : update sync [no ci] * files : relocate [no ci] * ci : disable kompute build [no ci] * cmake : fixes [no ci] * server : fix mingw build ggml-ci * cmake : minor [no ci] * cmake : link math library [no ci] * cmake : build normal ggml library (not object library) [no ci] * cmake : fix kompute build ggml-ci * make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE ggml-ci * move public backend headers to the public include directory (#8122) * move public backend headers to the public include directory * nix test * spm : fix metal header --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * scripts : fix sync paths [no ci] * scripts : sync ggml-blas.h [no ci] --------- Co-authored-by: slaren <slarengh@gmail.com>	2024-06-26 18:33:02 +03:00

34 Commits