- Add VK_VENDOR_ID_ARM (0x13B5) and VK_VENDOR_ID_QUALCOMM (0x5143).
- Disable 'l_warptile' (Large) paths on ARM and Qualcomm to avoid register pressure/driver crashes.
- Explicitly disable cooperative matrix (fp16-matrix) on mobile GPUs to prevent crashes.
- Disable integer dot product (int8-matrix) on Qualcomm due to driver issues.
- Keep general FP16 enabled for large model support.
- Limit suballocation block size to 256MB to improve memory stability on Android.
|
||
|---|---|---|
| .. | ||
| cmake | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||