Georgi Gerganov
|
89961dea87
|
Merge branch 'master' into gg/flash-attn
|
2024-04-05 09:44:12 +03:00 |
Johannes Gäßler
|
c63dfdf765
|
fix cmake build
|
2024-04-02 13:48:13 +03:00 |
Johannes Gäßler
|
81da919864
|
no vec for hs, no hs==256 ncols==32 for Volta
|
2024-04-02 13:48:13 +03:00 |
Georgi Gerganov
|
d48ccf3ad4
|
sync : ggml (#6351)
* sync : ggml
ggml-ci
* cuda : move GGML_CUDA_DMMV constants to dmmv.cuh
---------
Co-authored-by: slaren <slarengh@gmail.com>
|
2024-03-29 17:45:46 +02:00 |
slaren
|
ae1f211ce2
|
cuda : refactor into multiple files (#6269)
|
2024-03-25 13:50:23 +01:00 |