Fooocus/ldm_patched
Hasham Vakani ⚡ 15547a8720 fix: Add NVIDIA Blackwell (RTX 50xx, sm_120) support
- Use bfloat16 dtype for UNet on Blackwell GPUs (compute major >= 12)
  which have native bf16 tensor core support
- Skip manual_cast for bfloat16 weights to avoid unnecessary casting
- Fix numpy TypeError with bfloat16 tensors in patch.py and
  ip_adapter.py by converting to float32 before .numpy() calls

Tested on RTX 5070 (sm_120, CUDA 12.8) with PyTorch nightly (cu128).
Generates images at ~3.2 it/s including Image Prompt (IP-Adapter) mode.

Fixes #3862, #4123, #4141
2026-03-04 05:43:44 +05:00
..
contrib feat: add support for playground v2.5 (#3073) 2024-06-04 20:15:49 +02:00
controlnet 2.1.826 2023-12-12 11:38:05 -08:00
k_diffusion feat: add restart sampler (#3219) 2024-07-01 14:24:21 +02:00
ldm Use weights_only for loading (#3427) 2024-08-03 12:33:01 +02:00
licenses-3rd backend 2024-01-27 05:12:34 -08:00
modules fix: Add NVIDIA Blackwell (RTX 50xx, sm_120) support 2026-03-04 05:43:44 +05:00
pfn Use weights_only for loading (#3427) 2024-08-03 12:33:01 +02:00
t2ia 2.1.826 2023-12-12 11:38:05 -08:00
taesd (requested) support AMD 8GB GPUs via Windows DirectML 2023-12-30 06:30:59 -08:00
unipc 2.1.826 2023-12-12 11:38:05 -08:00
utils backend 2024-01-27 05:12:34 -08:00