[refactor] condense group offloading #11990

a-r-r-o-w · 2025-07-25T00:50:35Z

The current implementation, after many of the recent updates, is hard to understand or debug/reason through. There is also one code path that seems completely unused (see the removed self.offload_to_disk_path related changes and the call into self._onload_from_disk).

This PR tries to refactor and clean up some of the implementation so that implementing new changes is easier in the future.

Related to comment which I'm trying to debug through

HuggingFaceDocBuilderDev · 2025-07-25T01:01:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Looks super clean!

I just ran some benchmarks to confirm if the changes don't have any detrimental effect on the speed-memory trade-off (code) and they look alright to me.

Would maybe also run all the group offloading tests on the GPU, too.

sayakpaul · 2025-08-06T02:59:17Z

src/diffusers/hooks/group_offloading.py

        finally:
            pinned_dict = None

-    def _transfer_tensor_to_device(self, tensor, source_tensor, current_stream=None):


Very clean!

sayakpaul · 2025-08-06T03:18:48Z

src/diffusers/hooks/group_offloading.py


-        if self.offload_to_disk_path:


sayakpaul · 2025-08-06T03:50:39Z

src/diffusers/hooks/group_offloading.py

+        with context:
+            # Load to CPU (if using streams) or directly to target device, pin, and async copy to device
+            device = self.onload_device if self.stream is None else "cpu"
+            loaded_tensors = safetensors.torch.load_file(self.safetensors_file_path, device=device)


device when supplied as torch.device("cuda") to the onload_device argument, it would fail here from safetensors complaining invalid device cuda. Simply wrapping up it within str() solves the issue.

DN6

Nice 👍🏽

a-r-r-o-w added 2 commits July 24, 2025 23:50

update

15f98db

update

c6d61fa

a-r-r-o-w mentioned this pull request Jul 25, 2025

Groupoffloading introduce bad results #11981

Open

a-r-r-o-w added 4 commits August 1, 2025 09:36

Merge branch 'main' into refactor/condense-group-offloading

548411b

Merge branch 'main' into refactor/condense-group-offloading

357668e

refactor

447e881

Merge branch 'main' into refactor/condense-group-offloading

8c6edb3

a-r-r-o-w mentioned this pull request Aug 5, 2025

Fix group offloading synchronization bug for parameter-only GroupModule's #12077

Open

a-r-r-o-w requested review from sayakpaul and DN6 and removed request for sayakpaul August 5, 2025 21:11

sayakpaul approved these changes Aug 6, 2025

View reviewed changes

DN6 approved these changes Aug 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[refactor] condense group offloading #11990

[refactor] condense group offloading #11990

a-r-r-o-w commented Jul 25, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

sayakpaul left a comment •

edited

Loading

Uh oh!

sayakpaul Aug 6, 2025

Uh oh!

sayakpaul Aug 6, 2025

Uh oh!

sayakpaul Aug 6, 2025

Uh oh!

DN6 left a comment

Uh oh!

Uh oh!

[refactor] condense group offloading #11990

Are you sure you want to change the base?

[refactor] condense group offloading #11990

Conversation

a-r-r-o-w commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

sayakpaul left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

DN6 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

a-r-r-o-w commented Jul 25, 2025 •

edited

Loading

sayakpaul left a comment •

edited

Loading