Since the prefill length is not fixed, graphs constructed for the prefill stage cannot be reused. For this reason, ACL graph execution is disabled by default during prefill. |
||
|---|---|---|
| .. | ||
| BLIS.md | ||
| CANN.md | ||
| CUDA-FEDORA.md | ||
| OPENCL.md | ||
| SYCL.md | ||
Since the prefill length is not fixed, graphs constructed for the prefill stage cannot be reused. For this reason, ACL graph execution is disabled by default during prefill. |
||
|---|---|---|
| .. | ||
| BLIS.md | ||
| CANN.md | ||
| CUDA-FEDORA.md | ||
| OPENCL.md | ||
| SYCL.md | ||