Commit Graph

34 Commits

Author SHA1 Message Date
hongruichen 30b40006cc remove unused declarations 2024-07-14 23:50:11 +08:00
hongruichen 148ceab70c add log op 2024-07-14 23:00:50 +08:00
hongruichen 100ccd5e7f add unary op template and more ops 2024-07-13 00:55:34 +08:00
hongruichen e3aa43adbd suppress warning 2024-07-12 23:26:11 +08:00
hongruichen f0894d897a wip
wip
2024-07-12 19:57:34 +08:00
hongruichen be3aa9631f use template function directly 2024-07-11 11:18:06 +08:00
hongruichen 8932135fdb add sqrt and mul ops 2024-07-11 00:08:08 +08:00
hongruichen 7ea28a6fac add helper function for binary op 2024-07-10 23:39:03 +08:00
hongruichen b6f29273f0 add function to get graph from cache 2024-07-10 23:08:32 +08:00
hongruichen 80051cfc4d remove unused variables 2024-07-10 19:57:47 +08:00
hongruichen b49b501e26 fix sprintf type 2024-07-10 19:48:57 +08:00
hongruichen 3feb574bf0 merge register_rpc_mem into alloc_rpc_mem 2024-07-10 19:40:02 +08:00
hongruichen e97d3a6c48 fix tensor buffer allocation
add log

commit qnn buffer after changed

add log

register_rpc_mem 2 times

update input tensors before graph finalize

default to QNN_TENSORMEMTYPE_RAW

set new tensors at execute

move write input tensors to exec

check if mem registered before actual do

register rpc mem once allocated
2024-07-10 19:32:39 +08:00
hongruichen dc7d83e121 add log 2024-07-10 00:33:23 +08:00
hongruichen 9add256efe use helper function instead 2024-07-10 00:31:39 +08:00
hongruichen a7be0693ba add log 2024-07-10 00:29:43 +08:00
hongruichen af869fd636 fix compiling error in debug build 2024-07-10 00:23:51 +08:00
Hongrui Chen 5f2e3918f6 refactoring ggml_qnn_tensor 2024-07-09 19:58:46 +08:00
Hongrui Chen 874216b9c8 remove unused members 2024-07-07 22:32:43 +08:00
hongruichen 263ffa962e small opt of the qnn graph config init 2024-07-05 23:07:27 +08:00
hongruichen 4b0f6b0cd6 add helper function to get Qnn_TensorType_t from ggml_tensor 2024-07-05 19:37:58 +08:00
hongruichen 0f2e68713c move tensor related function to utils 2024-07-05 19:02:38 +08:00
hongruichen 58cec14092 reformat 2024-07-05 17:38:54 +08:00
hongruichen 13dc3a02c3 use qnn graph inside add and mul ops 2024-07-05 13:27:16 +08:00
hongruichen a688ed324b add op param to add_nodes 2024-07-05 13:07:48 +08:00
hongruichen 4b2ee61f62 move graph map to backend object 2024-07-05 11:58:47 +08:00
hongruichen ca0d999c2a add ggml_qnn_graph 2024-07-05 11:35:18 +08:00
hongruichen 000240cf62 add clang format file and reformating 2024-07-04 23:29:31 +08:00
hongruichen 38f88d5fb1 fix compiling error after merge latest master 2024-07-03 00:13:53 +08:00
hongruichen 8b677d1b2f move qnn backend into sub folder 2024-07-02 19:42:14 +08:00
zhentaoyu 197fe6c1d7
[SYCL] Update SYCL-Rope op and Refactor (#8157)
* align with rope.cu and move sycl-op to a single file
2024-07-01 19:39:06 +08:00
Johannes Gäßler 85a267daaa
CUDA: fix MMQ stream-k for --split-mode row (#8167) 2024-06-27 16:26:05 +02:00
slaren 31ec3993f6
ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (#8140) 2024-06-26 21:34:14 +02:00
Georgi Gerganov f3f65429c4
llama : reorganize source code + improve CMake (#8006)
* scripts : update sync [no ci]

* files : relocate [no ci]

* ci : disable kompute build [no ci]

* cmake : fixes [no ci]

* server : fix mingw build

ggml-ci

* cmake : minor [no ci]

* cmake : link math library [no ci]

* cmake : build normal ggml library (not object library) [no ci]

* cmake : fix kompute build

ggml-ci

* make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE

ggml-ci

* move public backend headers to the public include directory (#8122)

* move public backend headers to the public include directory

* nix test

* spm : fix metal header

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* scripts : fix sync paths [no ci]

* scripts : sync ggml-blas.h [no ci]

---------

Co-authored-by: slaren <slarengh@gmail.com>
2024-06-26 18:33:02 +03:00