[Flax] Add finetune Stable Diffusion #999

duongna21 · 2022-10-26T17:00:28Z

What does this PR do?

Add Flax example for finetuning Stable Diffusion.
Ran well on Tesla A100 (40GB) with max batch size = 3.
EMA is not added because my 1-line tree_map implementation for EMA make training much slower, and it did not show a visible improvement on the result.

How to run (76% faster than PyTorch example with same args on Tesla A100)

export MODEL_NAME="duongna/stable-diffusion-v1-4-flax"
export dataset_name="lambdalabs/pokemon-blip-captions"

python train_text_to_image_flax.py \
  --pretrained_model_name_or_path=$MODEL_NAME \
  --dataset_name=$dataset_name \
  --resolution=512 --center_crop --random_flip \
  --train_batch_size=1 \
  --max_train_steps=15000 \
  --learning_rate=1e-05 \
  --max_grad_norm=1 \
  --output_dir="sd-pokemon-model"

Prompt: robotic cat with wings

Who can review?

cc @patrickvonplaten @patil-suraj

patil-suraj · 2022-10-26T17:02:35Z

Very cool PR @duongna21 , amazing, will take a look soon !

HuggingFaceDocBuilderDev · 2022-10-26T17:04:02Z

The documentation is not available anymore as the PR was closed or merged.

patil-suraj

Looks very good, thanks a lot for adding this example. Left some comments about dataloader and dtype.

Also, let's update the readme with an example command to show how to run these examples.

examples/text_to_image/train_text_to_image_flax.py

duongna21 · 2022-10-27T11:52:08Z

@patil-suraj Thanks for very helpful comments. Addressed them!

patil-suraj

Thanks for addressing the comments, good for merge now!

entrpn · 2022-10-27T22:30:33Z

@duongna21 Thank you for this contribution. I did have to make a change in order to get it working with TPUs.

device_type = jax.devices()[0].device_kind

weight_dtype = torch.float32
if 'TPU' in device_type:
    weight_dtype = jnp.float32
    if args.mixed_precision == "fp16":
        weight_dtype = jnp.float16
    if args.mixed_precision == "bf16":
        weight_dtype = jnp.bfloat16
else:
    if args.mixed_precision == "fp16":
        weight_dtype = torch.float16
    elif args.mixed_precision == "bf16":
        weight_dtype = torch.bfloat16

duongna21 · 2022-10-28T03:53:56Z

@entrpn Thanks a lot! Will be fixed at #1038.

[Flax] Add finetune Stable Diffusion

786599b

patil-suraj self-assigned this Oct 26, 2022

temporary fix

af126cc

patil-suraj reviewed Oct 27, 2022

View reviewed changes

duongna21 and others added 4 commits October 27, 2022 18:18

drop_last and seed

ef314aa

add dtype for mixed precision training

f36e04a

style

6fb7672

Add Flax example

6410bcb

patil-suraj approved these changes Oct 27, 2022

View reviewed changes

patil-suraj merged commit abe0582 into huggingface:main Oct 27, 2022

patil-suraj mentioned this pull request Oct 27, 2022

Update training and fine-tuning docs #1020

Merged

duongna21 deleted the add-finetune-sd-flax branch October 27, 2022 14:19

duongna21 mentioned this pull request Oct 28, 2022

[Flax SD finetune] Fix dtype #1038

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Flax] Add finetune Stable Diffusion #999

[Flax] Add finetune Stable Diffusion #999

Uh oh!

duongna21 commented Oct 26, 2022

Uh oh!

patil-suraj commented Oct 26, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 26, 2022 •

edited

Loading

Uh oh!

patil-suraj left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

duongna21 commented Oct 27, 2022

Uh oh!

patil-suraj left a comment

Uh oh!

entrpn commented Oct 27, 2022

Uh oh!

duongna21 commented Oct 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Flax] Add finetune Stable Diffusion #999

[Flax] Add finetune Stable Diffusion #999

Uh oh!

Conversation

duongna21 commented Oct 26, 2022

What does this PR do?

Who can review?

Uh oh!

patil-suraj commented Oct 26, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

duongna21 commented Oct 27, 2022

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

entrpn commented Oct 27, 2022

Uh oh!

duongna21 commented Oct 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Oct 26, 2022 •

edited

Loading