Commit Graph

4 Commits

Author SHA1 Message Date
slaren bf56fdecb3 cleanup 2024-04-17 18:37:28 +02:00
slaren ea2b79534e ggml : group all experts in a single ggml_mul_mat_id
cuda : improve mmid row copy
2024-04-05 15:17:18 +02:00
slaren 280345968d
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 01:16:01 +01:00
Georgi Gerganov d2819d5577
scripts : add helpers script for bench comparing commits (#5521)
* scripts : add helpers script for bench comparing commits

* scripts : detect CUDA

* set flags after checking the command line

* fix make flags

---------

Co-authored-by: slaren <slarengh@gmail.com>
2024-02-16 15:14:40 +02:00