Optimise Attention Mechanisms #145

Warvito · 2022-12-16T19:38:13Z

Signed-off-by: Walter Hugo Lopez Pinaya [email protected]

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito · 2022-12-16T22:47:30Z

Waiting for version 0.0.16 from xformers
facebookresearch/xformers#533 (comment)

Warvito · 2022-12-17T12:14:01Z

Adopting Linear layers is more efficiency instead of the 1x1 convs
https://github.com/Stability-AI/stablediffusion/blob/8bde0cf64f3735bb33d93bdb8e28120be45c479b/ldm/modules/attention.py#L285

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito · 2022-12-23T18:23:49Z

On my TITAN RTX, the new attention layers are making the 2D ddpm tutorial consume 20Gb, 33 sec per training epoch and 15 sec to sample 1 image. When using xformers, it consume a little less memory 18Gb, 38 sec per training epoch and 10 sec to sample 1 image. When I tested the autoencoderKL, it had no significant difference between with and without xformers. I will try with an A100 yet

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito · 2022-12-26T15:18:43Z

On a A100, the 2D DDPM tutorial takes 15-16 s per training epoch, 19 Gb of memory, and 8 s to sample. With xformers it takes 18-19s per training epoch, 16.4Gb of memory, and 8 s to generate 1 sample

…attention-mechanisms # Conflicts: # generative/networks/nets/diffusion_model_unet.py

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

danieltudosiu

The pull request is good. But we should try not to create so much duplicate code as in this PR. If time allows it please try and create an attention utils or something similar and aggregate reusable methods there.

generative/networks/nets/autoencoderkl.py

generative/networks/nets/diffusion_model_unet.py

…attention-mechanisms

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Add missing scale

a467aa5

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito linked an issue Dec 16, 2022 that may be closed by this pull request

Should we add xformers efficient memory attention mechanisms? #135

Closed

Warvito added 3 commits December 16, 2022 21:34

[WIP] Add efficient attention to DiffusionModelUNet

6d0852b

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

[WIP] Add baddbmm-based attention to DiffusionModelUNet

b7eb889

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Add efficient attentions to AutoencoderKL

e289ec9

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito added 2 commits December 23, 2022 15:18

Add code to check if xformers is available (#145)

f6ffa40

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Refactor attention layers (#145)

842cdf7

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Fix xformers import (#145)

df7b607

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito added 3 commits January 6, 2023 13:47

Merge branch 'main' into 135-should-we-add-xformers-efficient-memory-…

f0f2e03

…attention-mechanisms # Conflicts: # generative/networks/nets/diffusion_model_unet.py

Remove xformers from requirements.txt

1e2001b

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Add instructions to install xformers

1d7a053

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito marked this pull request as ready for review January 6, 2023 13:59

Warvito changed the title ~~[WIP] Optimise Attention Mechanisms~~ Optimise Attention Mechanisms Jan 6, 2023

danieltudosiu approved these changes Jan 8, 2023

View reviewed changes

Warvito added 2 commits January 14, 2023 14:06

Merge branch 'main' into 135-should-we-add-xformers-efficient-memory-…

5b4a81e

…attention-mechanisms

Add docstrings [#145]

c4c56c8

Signed-off-by: Walter Hugo Lopez Pinaya <[email protected]>

Warvito merged commit 1b34291 into main Jan 14, 2023

Warvito deleted the 135-should-we-add-xformers-efficient-memory-attention-mechanisms branch January 14, 2023 14:39

Warvito mentioned this pull request Jan 23, 2023

Fix torchscript errors #192

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimise Attention Mechanisms #145

Optimise Attention Mechanisms #145

Uh oh!

Warvito commented Dec 16, 2022

Uh oh!

Warvito commented Dec 16, 2022

Uh oh!

Warvito commented Dec 17, 2022 •

edited

Loading

Uh oh!

Warvito commented Dec 23, 2022

Uh oh!

Warvito commented Dec 26, 2022 •

edited

Loading

Uh oh!

danieltudosiu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimise Attention Mechanisms #145

Optimise Attention Mechanisms #145

Uh oh!

Conversation

Warvito commented Dec 16, 2022

Uh oh!

Warvito commented Dec 16, 2022

Uh oh!

Warvito commented Dec 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Warvito commented Dec 23, 2022

Uh oh!

Warvito commented Dec 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danieltudosiu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Warvito commented Dec 17, 2022 •

edited

Loading

Warvito commented Dec 26, 2022 •

edited

Loading