update the logic of `is_sequential_cpu_offload` #7788

yiyixuxu · 2024-04-27T05:51:04Z

follow up on this huggingface/accelerate#2701
when the sequential CPU offloading method is enabled for the pipeline, accelerate will try to install an AlignDevicesHook to each model component; if the model contains a buffer, it will install a SequentialHook with two AlignDevicesHook;

currently, we assume that the model is sequentially offloaded only the hook is an AlignDevicesHook. In this PR I updated logic to include the scenario whenSequentialHook is created

yiyixuxu · 2024-04-27T05:54:22Z

tests/pipelines/test_pipelines_common.py

@@ -1373,6 +1373,7 @@ def test_sequential_cpu_offload_forward_pass(self, expected_max_diff=1e-4):
        output_without_offload = pipe(**inputs)[0]

        pipe.enable_sequential_cpu_offload()
+        assert pipe._execution_device.type == pipe._offload_device.type


add a check to make sure the _execution_device attribute of pipeline work as expected since we rely on the accelerate hooks to infer the device

diffusers/src/diffusers/pipelines/pipeline_utils.py

Line 981 in 56bd7e6

def _execution_device(self):

Does it also make sense to check if the SequentialHook is properly installed when a model in a pipeline has buffers?

HuggingFaceDocBuilderDev · 2024-04-27T05:57:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/diffusers/pipelines/pipeline_utils.py

sayakpaul · 2024-04-27T06:07:03Z

tests/pipelines/test_pipelines_common.py

+            if hasattr(v, "_hf_hook"):
+                if isinstance(v._hf_hook, accelerate.hooks.SequentialHook):
+                    for hook in v._hf_hook.hooks:
+                        if not isinstance(hook, accelerate.hooks.AlignDevicesHook):
+                            offloaded_modules_with_incorrect_hooks[k] = type(v._hf_hook.hooks[0])


Could you explain this part of the check a bit?

sayakpaul

Thank you! Left some comments.

SunMarc

LGTM ! Thanks for updating the logic. +1 for @sayakpaul comments

yiyixuxu · 2024-04-29T19:50:34Z

tests/pipelines/test_pipelines_common.py


        inputs = self.get_dummy_inputs(generator_device)
        output_with_offload = pipe(**inputs)[0]

        max_diff = np.abs(to_np(output_with_offload) - to_np(output_without_offload)).max()
        self.assertLess(max_diff, expected_max_diff, "CPU offloading should not affect the inference results")

+        # make sure all `torch.nn.Module` components (except those in `self._exclude_from_cpu_offload`) are offloaded correctly


cc @DN6
are these offload tests being run? e.g. I randomly checked on dit and all its offloading tests are failing but I did not see that in the reports (I looked in slow tests too and did not see any)

Could it be because DiT has low usage and hence it's not picked up by utils/fetch_torch_cuda_pipeline_test_matrix.py and ultimately not being included here?

diffusers/.github/workflows/push_tests.yml

Line 95 in b02e211

tests/pipelines/${{ matrix.module }}

Yeah, DiT has low usage so those slow tests are skipped. They should be in the nightly tests though.

yiyixuxu · 2024-05-01T16:26:54Z

a follow-up (low-priority) item is to see if we can include the models we have been excluding from offloading

diffusers commit 21a7ff1 update the logic of `is_sequential_cpu_offload` (huggingface/diffusers#7788)

keepdying · 2024-07-24T11:22:00Z

I think instead of just checking AlignDevicesHook we should check its offload attribute is True to determine module is offloaded to cpu. In some cases it fails even though it shouldn't, e.g. when a pipeline initialized with its __init__ method and required components are initialized with .from_pretrained(model_path, device_map={"": 0})

SunMarc · 2024-07-24T14:48:58Z

Hey @keepdying, could you share a minimal reproducer of the error that you are facing in a seperate issue ? We can definitely switch to checking the offload attribute.

up

9fdd1ba

yiyixuxu requested review from SunMarc and sayakpaul April 27, 2024 05:51

yiyixuxu commented Apr 27, 2024

View reviewed changes

sayakpaul reviewed Apr 27, 2024

View reviewed changes

src/diffusers/pipelines/pipeline_utils.py Show resolved Hide resolved

sayakpaul reviewed Apr 27, 2024

View reviewed changes

Merge branch 'main' into offload-cleanup

57af49e

SunMarc approved these changes Apr 29, 2024

View reviewed changes

add comment to the tests + fix dit

3a8c4d6

yiyixuxu commented Apr 29, 2024

View reviewed changes

Merge branch 'main' into offload-cleanup

4596b51

yiyixuxu merged commit 21a7ff1 into main May 1, 2024
17 checks passed

yiyixuxu deleted the offload-cleanup branch May 1, 2024 16:27

XSE42 added a commit to XSE42/diffusers3d that referenced this pull request May 13, 2024

[Sync] diffusers commit 21a7ff1

d8f04de

diffusers commit 21a7ff1 update the logic of `is_sequential_cpu_offload` (huggingface/diffusers#7788)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update the logic of `is_sequential_cpu_offload` #7788

update the logic of `is_sequential_cpu_offload` #7788

yiyixuxu commented Apr 27, 2024

yiyixuxu Apr 27, 2024

sayakpaul Apr 27, 2024

HuggingFaceDocBuilderDev commented Apr 27, 2024

sayakpaul Apr 27, 2024

sayakpaul left a comment

SunMarc left a comment

yiyixuxu Apr 29, 2024

sayakpaul Apr 30, 2024

DN6 Apr 30, 2024

yiyixuxu commented May 1, 2024

keepdying commented Jul 24, 2024 •

edited

Loading

SunMarc commented Jul 24, 2024 •

edited

Loading

update the logic of is_sequential_cpu_offload #7788

update the logic of is_sequential_cpu_offload #7788

Conversation

yiyixuxu commented Apr 27, 2024

yiyixuxu Apr 27, 2024

Choose a reason for hiding this comment

sayakpaul Apr 27, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 27, 2024

sayakpaul Apr 27, 2024

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

SunMarc left a comment

Choose a reason for hiding this comment

yiyixuxu Apr 29, 2024

Choose a reason for hiding this comment

sayakpaul Apr 30, 2024

Choose a reason for hiding this comment

DN6 Apr 30, 2024

Choose a reason for hiding this comment

yiyixuxu commented May 1, 2024

keepdying commented Jul 24, 2024 • edited Loading

SunMarc commented Jul 24, 2024 • edited Loading

update the logic of `is_sequential_cpu_offload` #7788

update the logic of `is_sequential_cpu_offload` #7788

keepdying commented Jul 24, 2024 •

edited

Loading

SunMarc commented Jul 24, 2024 •

edited

Loading