Skip to content

Commit e10a9f5

Browse files
committed
fix bnb + enable_model_cpu_offload results
1 parent 2ecd21b commit e10a9f5

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

diffusers-quantization.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -455,8 +455,8 @@ pipe = FluxPipeline.from_pretrained(
455455
**bnb + `enable_model_cpu_offload`**:
456456
| Precision | Memory after loading | Peak memory | Inference time |
457457
|---------------|----------------------|-------------|----------------|
458-
| 4-bit | 12.584 GB | 17.281 GB | 12 seconds |
459-
| 8-bit | 19.273 GB | 24.432 GB | 27 seconds |
458+
| 4-bit | 12.383 GB | 12.383 GB | 17 seconds |
459+
| 8-bit | 19.182 GB | 23.428 GB | 27 seconds |
460460

461461
<details>
462462
<summary>Example (Flux-dev with fp8 layerwise casting + group offloading):</summary>

0 commit comments

Comments
 (0)