Skip to content

Conversation

@patil-suraj
Copy link
Contributor

@patil-suraj patil-suraj commented Sep 30, 2022

Fix gradient clipping when doing gradient accumulation. The gradients needs to clipped after they are synchronized when doing accumulation. cf huggingface/accelerate#641 (comment)
fixes #668

Although not sure why it only got triggered only in half-precision.

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Sep 30, 2022

The documentation is not available anymore as the PR was closed or merged.

@Maki9009
Copy link

so this has been fixed? I'm confused lol. i didn't when it was set on "no" didnt have enough cuda memory" when i changed it to fp16 i had a different error.. but i sadly don't remember atm. Also, i would love to be able to run this on something other than Colab. Like Vast.ai or RunPod. Since Google Doesn't provide Colab Pro World Wide. i tried to use it on Vast i was hit with a error of it not be able to install files

@patil-suraj patil-suraj merged commit 14f4af8 into main Oct 3, 2022
@patil-suraj patil-suraj deleted the fix-dreambooth-fp16 branch October 3, 2022 08:54
prathikr pushed a commit to prathikr/diffusers that referenced this pull request Oct 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[dreambooth] RuntimeError when trying to Train on a Tesla T4 on Colab.

4 participants