Fix EMA for multi-gpu training in the unconditional example #1930

anton-l · 2023-01-05T15:59:35Z

Now the unconditional EMA wrapper mimics the EMAModel from train_text_to_image.py. This isn't a full copy because the unconditional example uses a different decay schedule.

Fixes broken training on multiple GPUs: #1772 #1895

anton-l · 2023-01-05T16:01:24Z

examples/unconditional_image_generation/train_unconditional.py

        action="store_true",
-        default=True,
        help="Whether to use Exponential Moving Average for the final model weights.",
    )


Also removing the wrong default, as it was done for the text2img example (issue #1654)

small breaking change but think that's fine!

HuggingFaceDocBuilderDev · 2023-01-05T16:05:06Z

The documentation is not available anymore as the PR was closed or merged.

src/diffusers/training_utils.py

…fix-unconditional-ema

src/diffusers/training_utils.py

examples/unconditional_image_generation/train_unconditional.py

src/diffusers/training_utils.py

Co-authored-by: Pedro Cuenca <[email protected]>

src/diffusers/training_utils.py

patrickvonplaten

The PR looks very nice to me! I'd also just try to make it more or less 100% backwards compatible (let's try to be role-models when it comes to that in OSS).

Think it's not too hard:

detect if a model is passed
probs have to keep a mapping of parameters to names to be able to use new logic but also return model as done previously
we can relatively quickly deprecate here then I think

patil-suraj · 2023-01-18T13:39:41Z

The EMAModel should now be 100% backwards compatible. @patrickvonplaten @pcuenca would be nice if you could take one more look.

patil-suraj · 2023-01-18T14:36:00Z

The failing tests are unrelated.

pcuenca

Looks great! Just pointed out a couple nits.

src/diffusers/training_utils.py

Co-authored-by: Pedro Cuenca <[email protected]>

patrickvonplaten · 2023-01-18T17:08:09Z

Good to merge for me - thanks @patil-suraj !

…ace#1930) * improve EMA * style * one EMA model * quality * fix tests * fix test * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> * re organise the unconditional script * backwards compatibility * default to init values for some args * fix ort script * issubclass => isinstance * update state_dict * docstr * doc * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> * use .to if device is passed * deprecate device * make flake happy * fix typo Co-authored-by: patil-suraj <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

anton-l added 2 commits January 5, 2023 16:51

improve EMA

a4c9118

style

4e32811

anton-l requested review from patil-suraj, patrickvonplaten and pcuenca January 5, 2023 15:59

anton-l commented Jan 5, 2023

View reviewed changes

patrickvonplaten reviewed Jan 5, 2023

View reviewed changes

src/diffusers/training_utils.py Outdated Show resolved Hide resolved

patil-suraj self-assigned this Jan 17, 2023

patil-suraj added 3 commits January 17, 2023 11:44

Merge branch 'main' of https://github.com/huggingface/diffusers into …

4f55f59

…fix-unconditional-ema

one EMA model

2bb2d38

quality

3b27481

patil-suraj reviewed Jan 17, 2023

View reviewed changes

src/diffusers/training_utils.py Show resolved Hide resolved

src/diffusers/training_utils.py Show resolved Hide resolved

src/diffusers/training_utils.py Show resolved Hide resolved

src/diffusers/training_utils.py Show resolved Hide resolved

patil-suraj requested review from patil-suraj and patrickvonplaten January 17, 2023 13:05

fix tests

e9db8cd

anton-l commented Jan 17, 2023

View reviewed changes

src/diffusers/training_utils.py Show resolved Hide resolved

patil-suraj reviewed Jan 17, 2023

View reviewed changes

src/diffusers/training_utils.py Outdated Show resolved Hide resolved

fix test

43f0fe3

pcuenca reviewed Jan 17, 2023

View reviewed changes

patil-suraj and others added 2 commits January 17, 2023 15:45

Apply suggestions from code review

db2d359

Co-authored-by: Pedro Cuenca <[email protected]>

re organise the unconditional script

e7ac781

patrickvonplaten reviewed Jan 17, 2023

View reviewed changes

src/diffusers/training_utils.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 17, 2023

View reviewed changes

patil-suraj added 6 commits January 18, 2023 14:17

backwards compatibility

7092f3a

default to init values for some args

f2b5d40

fix ort script

eeed3af

issubclass => isinstance

7f84f1b

update state_dict

be02125

docstr

f61474a

patil-suraj added 2 commits January 18, 2023 14:41

Merge branch 'main' into fix-unconditional-ema

2eb8a0c

doc

df2d9e0

pcuenca approved these changes Jan 18, 2023

View reviewed changes

src/diffusers/training_utils.py Show resolved Hide resolved

src/diffusers/training_utils.py Outdated Show resolved Hide resolved

src/diffusers/training_utils.py Outdated Show resolved Hide resolved

src/diffusers/training_utils.py Show resolved Hide resolved

patil-suraj and others added 4 commits January 18, 2023 17:10

Apply suggestions from code review

a94b53b

Co-authored-by: Pedro Cuenca <[email protected]>

use .to if device is passed

c4409e4

deprecate device

bd9f142

make flake happy

0bafc11

patrickvonplaten and others added 2 commits January 19, 2023 09:43

Merge branch 'main' into fix-unconditional-ema

e65e35e

fix typo

dc1935f

patil-suraj merged commit 7c82a16 into main Jan 19, 2023

patil-suraj deleted the fix-unconditional-ema branch January 19, 2023 10:55

anton-l mentioned this pull request Jan 30, 2023

Unconditional Image Generation generating noise #1772

Closed

Fix EMA for multi-gpu training in the unconditional example #1930

Fix EMA for multi-gpu training in the unconditional example #1930

Uh oh!

Conversation

anton-l commented Jan 5, 2023

Uh oh!

anton-l Jan 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Jan 5, 2023

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

patil-suraj commented Jan 18, 2023

Uh oh!

patil-suraj commented Jan 18, 2023

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Jan 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

anton-l Jan 5, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 5, 2023 •

edited

Loading