This PR enables concurrent streams introduced in #16991 by default. To disable a new env flag `GGML_CUDA_DISABLE_GRAPH_OPT` is introduced |
||
|---|---|---|
| .. | ||
| cmake | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||
This PR enables concurrent streams introduced in #16991 by default. To disable a new env flag `GGML_CUDA_DISABLE_GRAPH_OPT` is introduced |
||
|---|---|---|
| .. | ||
| cmake | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||