hongruichen
30b40006cc
remove unused declarations
2024-07-14 23:50:11 +08:00
hongruichen
148ceab70c
add log op
2024-07-14 23:00:50 +08:00
hongruichen
100ccd5e7f
add unary op template and more ops
2024-07-13 00:55:34 +08:00
hongruichen
e3aa43adbd
suppress warning
2024-07-12 23:26:11 +08:00
hongruichen
f0894d897a
wip
...
wip
2024-07-12 19:57:34 +08:00
hongruichen
be3aa9631f
use template function directly
2024-07-11 11:18:06 +08:00
hongruichen
8932135fdb
add sqrt and mul ops
2024-07-11 00:08:08 +08:00
hongruichen
7ea28a6fac
add helper function for binary op
2024-07-10 23:39:03 +08:00
hongruichen
b6f29273f0
add function to get graph from cache
2024-07-10 23:08:32 +08:00
hongruichen
80051cfc4d
remove unused variables
2024-07-10 19:57:47 +08:00
hongruichen
b49b501e26
fix sprintf type
2024-07-10 19:48:57 +08:00
hongruichen
3feb574bf0
merge register_rpc_mem into alloc_rpc_mem
2024-07-10 19:40:02 +08:00
hongruichen
e97d3a6c48
fix tensor buffer allocation
...
add log
commit qnn buffer after changed
add log
register_rpc_mem 2 times
update input tensors before graph finalize
default to QNN_TENSORMEMTYPE_RAW
set new tensors at execute
move write input tensors to exec
check if mem registered before actual do
register rpc mem once allocated
2024-07-10 19:32:39 +08:00
hongruichen
dc7d83e121
add log
2024-07-10 00:33:23 +08:00
hongruichen
9add256efe
use helper function instead
2024-07-10 00:31:39 +08:00
hongruichen
a7be0693ba
add log
2024-07-10 00:29:43 +08:00
hongruichen
af869fd636
fix compiling error in debug build
2024-07-10 00:23:51 +08:00
Hongrui Chen
5f2e3918f6
refactoring ggml_qnn_tensor
2024-07-09 19:58:46 +08:00
Hongrui Chen
874216b9c8
remove unused members
2024-07-07 22:32:43 +08:00
hongruichen
263ffa962e
small opt of the qnn graph config init
2024-07-05 23:07:27 +08:00
hongruichen
4b0f6b0cd6
add helper function to get Qnn_TensorType_t from ggml_tensor
2024-07-05 19:37:58 +08:00
hongruichen
0f2e68713c
move tensor related function to utils
2024-07-05 19:02:38 +08:00
hongruichen
58cec14092
reformat
2024-07-05 17:38:54 +08:00
hongruichen
13dc3a02c3
use qnn graph inside add and mul ops
2024-07-05 13:27:16 +08:00
hongruichen
a688ed324b
add op param to add_nodes
2024-07-05 13:07:48 +08:00
hongruichen
4b2ee61f62
move graph map to backend object
2024-07-05 11:58:47 +08:00
hongruichen
ca0d999c2a
add ggml_qnn_graph
2024-07-05 11:35:18 +08:00
hongruichen
000240cf62
add clang format file and reformating
2024-07-04 23:29:31 +08:00
hongruichen
38f88d5fb1
fix compiling error after merge latest master
2024-07-03 00:13:53 +08:00
hongruichen
8b677d1b2f
move qnn backend into sub folder
2024-07-02 19:42:14 +08:00
zhentaoyu
197fe6c1d7
[SYCL] Update SYCL-Rope op and Refactor ( #8157 )
...
* align with rope.cu and move sycl-op to a single file
2024-07-01 19:39:06 +08:00
Johannes Gäßler
85a267daaa
CUDA: fix MMQ stream-k for --split-mode row ( #8167 )
2024-06-27 16:26:05 +02:00
slaren
31ec3993f6
ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) ( #8140 )
2024-06-26 21:34:14 +02:00
Georgi Gerganov
f3f65429c4
llama : reorganize source code + improve CMake ( #8006 )
...
* scripts : update sync [no ci]
* files : relocate [no ci]
* ci : disable kompute build [no ci]
* cmake : fixes [no ci]
* server : fix mingw build
ggml-ci
* cmake : minor [no ci]
* cmake : link math library [no ci]
* cmake : build normal ggml library (not object library) [no ci]
* cmake : fix kompute build
ggml-ci
* make,cmake : fix LLAMA_CUDA + replace GGML_CDEF_PRIVATE
ggml-ci
* move public backend headers to the public include directory (#8122 )
* move public backend headers to the public include directory
* nix test
* spm : fix metal header
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
* scripts : fix sync paths [no ci]
* scripts : sync ggml-blas.h [no ci]
---------
Co-authored-by: slaren <slarengh@gmail.com>
2024-06-26 18:33:02 +03:00