allow multiple generations per prompt #741

patil-suraj · 2022-10-05T19:30:30Z

To generate multiple images for a prompt currently we need to repeat the prompts before calling the pipeline.

pipe = StableDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4", 
    revision="fp16", 
    torch_dtype=torch.float16,
)
pipe = pipe.to("cuda")

prompt = "a photo of an astronaut riding a horse on mars"
images = pipe([prompt] * 2).images

Because of this the text embeddings and uncond embeddings are computed multiple times for the same prompt.

This PR adds num_images_per_prompt argument to stable diffusion pipelines to allow returning multiple images per prompt without repeating them. With this the text embeddings and uncond embeddings for each prompt are computed once and repeated according to the value of num_images_per_prompt.

prompt = "a photo of an astronaut riding a horse on mars"
images = pipe(prompt, num_images_per_prompt=2).images
assert len(images) == 2

Thanks @NouamaneTazi !

HuggingFaceDocBuilderDev · 2022-10-05T19:34:34Z

The documentation is not available anymore as the PR was closed or merged.

pcuenca

Looks good!

We should also do it in the flax version for faster inference, but we can do so later.

NouamaneTazi

Amazing PR, thank you for taking care of this 🙏🏼

NouamaneTazi · 2022-10-06T11:19:17Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

            uncond_embeddings = self.text_encoder(uncond_input.input_ids.to(self.device))[0]

+            # duplicate unconditional embeddings for each generation per prompt
+            uncond_embeddings = uncond_embeddings.repeat_interleave(batch_size * num_images_per_prompt, dim=0)


Last time I checked I had issues with repeat_interleave and ONNX. We probably just need to require a specific opset version though.
Can you please double-check @anton-l?

Thanks, will check that. But this PR doesn't modify onnx pipeline, so it should be fine.

patrickvonplaten

Nice!

rvorias · 2022-10-13T13:58:35Z

In the current implementation, num_images_per_prompt is just a batch_size scaler.

e.g. if you want 9 images, the current implementation multiplies the batch size by 9. Good luck with fitting that on a consumer GPU 😁

When I saw this arg I was expecting it to have sequential pipeline runs (with just the same prompt). A lot of other repos have it like this.

Extra kudos: when num_images_per_prompt > 1, also allow a list of seeds intake.

patrickvonplaten · 2022-10-14T17:55:17Z

We need indeed some good warning / error throwing system here

rvorias · 2022-10-14T18:49:33Z

We need indeed some good warning / error throwing system here

Hmm I'm not sure this has anything to do with warning or errors. Just pointing out that -from a user perspective- batch_size and num_images_per_prompt are the same.

patrickvonplaten · 2022-10-14T19:18:48Z

Yeah but batch_size cannot be passed in stable diffusion pipeline as an input argument no?

patil-suraj · 2022-10-17T09:28:45Z

Thanks for the comment @rvorias ! It would be bit complicated to run pipe in sequence inside the pipe, the users can do it very easily, by just calling the pipeline in a loop, this way you as a users have full control over it. We'll document this better in examples, docs :)

* compute text embeds per prompt * don't repeat uncond prompts * repeat separatly * update image2image * fix repeat uncond embeds * adapt inpaint pipeline * ifx uncond tokens in img2img * add tests and fix ucond embeds in im2img and inpaint pipe

compute text embeds per prompt

a72f9d8

patil-suraj added 4 commits October 5, 2022 21:52

don't repeat uncond prompts

8b82c88

repeat separatly

93a93a9

update image2image

6726896

fix repeat uncond embeds

1483a64

pcuenca approved these changes Oct 6, 2022

View reviewed changes

patil-suraj added 3 commits October 6, 2022 12:27

adapt inpaint pipeline

3872662

ifx uncond tokens in img2img

8741535

add tests and fix ucond embeds in im2img and inpaint pipe

54269ec

patil-suraj requested review from anton-l, patrickvonplaten and pcuenca October 6, 2022 11:13

NouamaneTazi approved these changes Oct 6, 2022

View reviewed changes

patrickvonplaten approved these changes Oct 6, 2022

View reviewed changes

patil-suraj merged commit c119dc4 into main Oct 6, 2022

patil-suraj deleted the compute-embeds-ones branch October 6, 2022 12:01

pcuenca mentioned this pull request Oct 7, 2022

StableDiffusionPipeline producing unexpected output with MPS device using diffusers==0.4.0 #760

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

allow multiple generations per prompt #741

allow multiple generations per prompt #741

Uh oh!

patil-suraj commented Oct 5, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 5, 2022 •

edited

Loading

Uh oh!

pcuenca left a comment

Uh oh!

NouamaneTazi left a comment

Uh oh!

NouamaneTazi Oct 6, 2022

Uh oh!

patil-suraj Oct 6, 2022

Uh oh!

patrickvonplaten left a comment

Uh oh!

rvorias commented Oct 13, 2022

Uh oh!

patrickvonplaten commented Oct 14, 2022

Uh oh!

rvorias commented Oct 14, 2022

Uh oh!

patrickvonplaten commented Oct 14, 2022

Uh oh!

patil-suraj commented Oct 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

allow multiple generations per prompt #741

allow multiple generations per prompt #741

Uh oh!

Conversation

patil-suraj commented Oct 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

NouamaneTazi left a comment

Choose a reason for hiding this comment

Uh oh!

NouamaneTazi Oct 6, 2022

Choose a reason for hiding this comment

Uh oh!

patil-suraj Oct 6, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

rvorias commented Oct 13, 2022

Uh oh!

patrickvonplaten commented Oct 14, 2022

Uh oh!

rvorias commented Oct 14, 2022

Uh oh!

patrickvonplaten commented Oct 14, 2022

Uh oh!

patil-suraj commented Oct 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

patil-suraj commented Oct 5, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 5, 2022 •

edited

Loading