[Community Pipeline] : First proposal of a multilingual stable diffusion … #1028

juancopi81 · 2022-10-27T15:13:36Z

[Community Pipeline] Based on issue #871 I thought it would be nice to have a pipeline with multilingual support and @patrickvonplaten said it was a good idea! :)

Followed the format of #897.

Code example (Also added in the PR):

from PIL import Image

import torch

from diffusers import DiffusionPipeline
from transformers import (
    pipeline,
    MBart50TokenizerFast,
    MBartForConditionalGeneration,
)
device = "cuda" if torch.cuda.is_available() else "cpu"
device_dict = {"cuda": 0, "cpu": -1}

# helper function taken from: https://huggingface.co/blog/stable_diffusion
def image_grid(imgs, rows, cols):
    assert len(imgs) == rows*cols

    w, h = imgs[0].size
    grid = Image.new('RGB', size=(cols*w, rows*h))
    grid_w, grid_h = grid.size

    for i, img in enumerate(imgs):
        grid.paste(img, box=(i%cols*w, i//cols*h))
    return grid

# Add language detection pipeline
language_detection_model_ckpt = "papluca/xlm-roberta-base-language-detection"
language_detection_pipeline = pipeline("text-classification",
                                       model=language_detection_model_ckpt,
                                       device=device_dict[device])

# Add model for language translation
trans_tokenizer = MBart50TokenizerFast.from_pretrained("facebook/mbart-large-50-many-to-one-mmt")
trans_model = MBartForConditionalGeneration.from_pretrained("facebook/mbart-large-50-many-to-one-mmt").to(device)

diffuser_pipeline = DiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4",
    custom_pipeline="multilingual_stable_diffusion",
    language_detection_pipeline=language_detection_pipeline,
    translation_model=trans_model,
    translation_tokenizer=trans_tokenizer,
    revision="fp16",
    torch_dtype=torch.float16,
)

diffuser_pipeline.enable_attention_slicing()
diffuser_pipeline = diffuser_pipeline.to(device)

prompt = ["a photograph of an astronaut riding a horse", 
          "Una casa en la playa",
          "Ein Hund, der Orange isst",
          "Un restaurant parisien"]

output = diffuser_pipeline(prompt)

images = output.images

grid = image_grid(images, rows=2, cols=2)

This example produces the following image:

…pipeline

HuggingFaceDocBuilderDev · 2022-10-27T15:17:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

fix jnp dtype

* fix `upsample_nearest_nhwc` for large bsz * fix `upsample_nearest_nhwc` for large bsz

* improve tests * up * finish * upload * add init * up * finish vae * finish * reduce loading time with device_map * remove device_map from CPU * uP

* [Tests] Speed up slow tests * Up * up

* up * up * up * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py * Apply suggestions from code review

* Update training and fine-tuning docs. * Update examples README. * Update README. * Add Flax fine-tuning section. * Accept suggestion Co-authored-by: Anton Lozhkov <[email protected]> * Accept suggestion Co-authored-by: Anton Lozhkov <[email protected]> Co-authored-by: Anton Lozhkov <[email protected]>

* add seed resizing to community examples * actually add the file responsible for seed resizing

…pipeline

juancopi81 · 2022-10-29T16:59:06Z

Sorry I had some mistake so I am closing this pull request and go back to fix some issues.

juancopi81 added 2 commits October 27, 2022 10:01

Community example: First proposal of a multilingual stable diffusion …

1508b51

…pipeline

Minor bug in readme file

cfada76

juancopi81 and others added 23 commits October 27, 2022 13:58

Fix some grammar errors

a1bfa54

Support grayscale images in numpy_to_pil (#1025)

fb38bb1

[Flax SD finetune] Fix dtype (#1038)

1e07b6b

fix jnp dtype

fix F.interpolate() for large batch sizes (#1006)

ab079f2

* fix `upsample_nearest_nhwc` for large bsz * fix `upsample_nearest_nhwc` for large bsz

[Tests] Improve unet / vae tests (#1018)

a80480f

* improve tests * up * finish * upload * add init * up * finish vae * finish * reduce loading time with device_map * remove device_map from CPU * uP

[Tests] Speed up slow tests (#1040)

d2d9764

* [Tests] Speed up slow tests * Up * up

Fix some failing tests (#1041)

8d6487f

* up * up * up * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py * Apply suggestions from code review

[Tests] Better prints (#1043)

c4ef1ef

[Tests] no random latents anymore (#1045)

d37f08d

hot fix

cbbb293

fix

ea01a4c

increase tolerance

a7ae808

higher precision for vae

81b6fbf

Fix speedup ratio in fp16.mdx (#837)

fc0ca47

clean incomplete pages (#1008)

12fd073

Add seed resizing to community pipelines (#1011)

1fc2088

* add seed resizing to community examples * actually add the file responsible for seed resizing

Community example: First proposal of a multilingual stable diffusion …

df757dd

…pipeline

Minor bug in readme file

27b7f22

Fix some grammar errors

64911eb

Add correct link to the example

0edba0e

Add correct link to readme file

c5acc63

Changes to readm file

2ff04b3

juancopi81 closed this Oct 29, 2022

juancopi81 deleted the multilingual_text_to_image_pipeline branch October 29, 2022 16:58

PhaneeshB pushed a commit to nod-ai/diffusers that referenced this pull request Mar 1, 2023

Add conditions to force use --import_mlir (huggingface#1028)

d973ba1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Community Pipeline] : First proposal of a multilingual stable diffusion … #1028

[Community Pipeline] : First proposal of a multilingual stable diffusion … #1028

Uh oh!

juancopi81 commented Oct 27, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 27, 2022

Uh oh!

juancopi81 commented Oct 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

[Community Pipeline] : First proposal of a multilingual stable diffusion … #1028

[Community Pipeline] : First proposal of a multilingual stable diffusion … #1028

Uh oh!

Conversation

juancopi81 commented Oct 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 27, 2022

Uh oh!

juancopi81 commented Oct 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

juancopi81 commented Oct 27, 2022 •

edited

Loading