Commit Graph

5 Commits

Author SHA1 Message Date
Georgi Gerganov 89961dea87
Merge branch 'master' into gg/flash-attn 2024-04-05 09:44:12 +03:00
Johannes Gäßler c63dfdf765 fix cmake build 2024-04-02 13:48:13 +03:00
Johannes Gäßler 81da919864 no vec for hs, no hs==256 ncols==32 for Volta 2024-04-02 13:48:13 +03:00
Georgi Gerganov d48ccf3ad4
sync : ggml (#6351)
* sync : ggml

ggml-ci

* cuda : move GGML_CUDA_DMMV constants to dmmv.cuh

---------

Co-authored-by: slaren <slarengh@gmail.com>
2024-03-29 17:45:46 +02:00
slaren ae1f211ce2
cuda : refactor into multiple files (#6269) 2024-03-25 13:50:23 +01:00