[refactor] make set_attention_slice recursive #1532

patil-suraj · 2022-12-03T16:05:13Z

Inspired by #1493, this PR makes set_attention_slice method recursive to be able to easily apply it to various attention blocks without much boilerplate.

Make set_attention_slice a method of DiffusionPipeline, so it can be applied to all pipelines, since all of them have attention.
Handle the auto slice logic inside UNet2DConditionModel and recurse there to apply the slice to all blocks.

The logic differs a bit from #1493, in that, the we keep the set_attention_slice method on main model classes like UNet2DConditionModel and move the recursive application logic there. This is so that we can control this functionality using individual model classes.

HuggingFaceDocBuilderDev · 2022-12-03T16:10:03Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten · 2022-12-05T11:24:22Z

examples/community/clip_guided_stable_diffusion.py

        set_requires_grad(self.text_encoder, False)
        set_requires_grad(self.clip_model, False)

-    def enable_attention_slicing(self, slice_size: Optional[Union[str, int]] = "auto"):


Let's not change community pipelines. This will remove functionality at the moment.

patrickvonplaten · 2022-12-05T11:24:44Z

src/diffusers/models/attention.py

                )

-    def _set_attention_slice(self, slice_size):
+    def set_attention_slice(self, slice_size):


Nice! works for me

patrickvonplaten

Thanks for doing this PR! Let's just remove the changes from the community pipelines.

pcuenca

Very nice! Same comment as Patrick, this would currently break community pipelines as they are loaded from main.

…recirsive-attn-slice

patil-suraj

Looks good to me! Just left one comment

patil-suraj · 2022-12-05T16:17:59Z

src/diffusers/models/unet_2d_condition.py

+        if slice_size == "auto":
+            # half the attention head size is usually a good trade-off between
+            # speed and memory
+            slice_size = [dim // 2 for dim in sliceable_head_dims]
+        elif slice_size == "max":
+            # make smallest slice possible
+            slice_size = num_slicable_layers * [1]
+


As discussed offline, this logic could be moved to the attention block itself. That way if would be easy to support this functionality in new models which use the attention blocks.

That makes sense. As discussed offline, I'll leave as is now though so that we can test that number of passed slices equals number of slicable layers. Otherwise we cannot really test it.

patil-suraj · 2022-12-05T16:18:47Z

src/diffusers/pipelines/versatile_diffusion/modeling_text_unet.py

+        if slice_size == "auto":
+            # half the attention head size is usually a good trade-off between
+            # speed and memory
+            slice_size = [dim // 2 for dim in sliceable_head_dims]
+        elif slice_size == "max":
+            # make smallest slice possible
+            slice_size = num_slicable_layers * [1]
+
+        slice_size = num_slicable_layers * [slice_size] if not isinstance(slice_size, list) else slice_size


same comment as above.

patil-suraj · 2022-12-05T16:23:20Z

tests/models/test_models_unet_2d.py

+    def test_model_attention_slicing(self):
+        init_dict, inputs_dict = self.prepare_init_args_and_inputs_for_common()
+
+        init_dict["attention_head_dim"] = (8, 16)
+
+        model = self.model_class(**init_dict)
+        model.to(torch_device)
+        model.eval()
+
+        model.set_attention_slice("auto")
+        with torch.no_grad():
+            output = model(**inputs_dict)
+        assert output is not None
+
+        model.set_attention_slice("max")
+        with torch.no_grad():
+            output = model(**inputs_dict)
+        assert output is not None
+
+        model.set_attention_slice(2)
+        with torch.no_grad():
+            output = model(**inputs_dict)
+        assert output is not None
+
+    def test_model_slicable_head_dim(self):
+        init_dict, inputs_dict = self.prepare_init_args_and_inputs_for_common()
+
+        init_dict["attention_head_dim"] = (8, 16)
+
+        model = self.model_class(**init_dict)
+
+        def check_slicable_dim_attr(module: torch.nn.Module):
+            if hasattr(module, "set_attention_slice"):
+                assert isinstance(module.sliceable_head_dim, int)
+
+            for child in module.children():
+                check_slicable_dim_attr(child)
+


Thanks a lot for adding the tests!

* make attn slice recursive * remove set_attention_slice from blocks * fix copies * make enable_attention_slicing base class method of DiffusionPipeline * fix set_attention_slice * fix set_attention_slice * fix copies * add tests * up * up * up * update * up * uP Co-authored-by: Patrick von Platen <[email protected]>

make attn slice recursive

24db527

patil-suraj added 6 commits December 3, 2022 17:16

remove set_attention_slice from blocks

e813e62

fix copies

a5dbf60

make enable_attention_slicing base class method of DiffusionPipeline

2f85c1b

fix set_attention_slice

05677bc

fix set_attention_slice

7546e18

fix copies

6800c04

patil-suraj requested review from anton-l, patrickvonplaten and pcuenca December 5, 2022 11:09

Merge branch 'main' into recirsive-attn-slice

91d1a95

patil-suraj changed the title ~~[wip] make attn slice recursive~~ [refactor] make set_attention_slice recursive Dec 5, 2022

patrickvonplaten reviewed Dec 5, 2022

View reviewed changes

patrickvonplaten approved these changes Dec 5, 2022

View reviewed changes

pcuenca approved these changes Dec 5, 2022

View reviewed changes

patrickvonplaten added 7 commits December 5, 2022 15:41

add tests

fc9dda1

up

8764535

Merge branch 'main' of https://github.com/huggingface/diffusers into …

1797073

…recirsive-attn-slice

up

9413529

up

ce8942b

update

855a69c

up

068e779

patrickvonplaten mentioned this pull request Dec 5, 2022

Stable Diffusion 2 doesn't work with enable_attention_slicing() #1552

Closed

patil-suraj commented Dec 5, 2022

View reviewed changes

uP

8e0a98d

patrickvonplaten merged commit bce65cd into main Dec 5, 2022

patrickvonplaten deleted the recirsive-attn-slice branch December 5, 2022 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[refactor] make set_attention_slice recursive #1532

[refactor] make set_attention_slice recursive #1532

Uh oh!

patil-suraj commented Dec 3, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Dec 3, 2022 •

edited

Loading

Uh oh!

patrickvonplaten Dec 5, 2022

Uh oh!

patrickvonplaten Dec 5, 2022

Uh oh!

patrickvonplaten left a comment

Uh oh!

pcuenca left a comment

Uh oh!

patil-suraj left a comment

Uh oh!

patil-suraj Dec 5, 2022

Uh oh!

patrickvonplaten Dec 5, 2022

Uh oh!

patil-suraj Dec 5, 2022

Uh oh!

patil-suraj Dec 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[refactor] make set_attention_slice recursive #1532

[refactor] make set_attention_slice recursive #1532

Uh oh!

Conversation

patil-suraj commented Dec 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Dec 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

patil-suraj Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

patil-suraj Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

patil-suraj commented Dec 3, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 3, 2022 •

edited

Loading