Warning for too long prompts in DiffusionPipelines (Resolve #447) #472

shirayu · 2022-09-12T04:34:49Z

To resolve Prompt truncation detection and return of tokenized prompts #447

HuggingFaceDocBuilderDev · 2022-09-12T04:38:50Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten · 2022-09-16T17:04:32Z

Thanks for opening the PR! I think we can make our lives a bit easier here by simply catching whether the input was truncated or not before hand - then we don't have to return the text embeddings and check after the fact :-). How about we replace these lines here:

diffusers/src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

Line 175 in 8897217

text_input = self.tokenizer(

with:

text_inputs = self.tokenizer(
    prompt,
    padding="max_length",
    max_length=self.tokenizer.model_max_length,
    return_tensors="pt",
)
text_input_ids = text_inputs.input_ids

if text_input_ids.shape[-1] > self.tokenizer.model_max_length:
    removed_text = self.tokenizer.batch_decode(text_input_ids[self.tokenizer_model_max_length:])
    logger.warn(f"The following part of your input was truncated because CLIP can only handle sequences up to 77 tokens: {removed_text}")

text_input_ids = text_input_ids[:, :self.tokenizer.model_max_length].to(self.device)

Then the user gets a nice warning and we are error-robust :-)

README.md

shirayu · 2022-09-17T11:59:25Z

Thank you for the comment @patrickvonplaten !

I think using logger.warn is useful for users of CLI, but it is not when used from other applications.
For example, if a web app accepts a token and outputs the result with these pipelines, the method cannot display a warning message to the web app user.

patrickvonplaten · 2022-09-17T12:24:15Z

Hey @shirayu,

I think you can catch warnings with PyTorch: https://stackoverflow.com/questions/5644836/in-python-how-does-one-catch-warnings-as-if-they-were-exceptions

I'm not sure we want to return text embeddings just for a potential warning that could be displayed later - the use case is too small to warrant adding a new output tuple which might break pipelines that expect only two outputs.

Would it be ok for you to try out catching the warning or adding a specific logger? https://stackoverflow.com/questions/14058453/making-python-loggers-output-all-messages-to-stdout-in-addition-to-log-file

patrickvonplaten · 2022-09-17T12:25:19Z

Also @patil-suraj @pcuenca @anton-l what do you think here?

patrickvonplaten · 2022-09-22T13:43:46Z

src/diffusers/pipelines/stable_diffusion/__init__.py


 import PIL
 from PIL import Image
+from transformers import BatchEncoding


Could you please move this under line 34 (below is_transformers_available()) ? Otherwise this breaks the init as transformers is not a hard dependency

patrickvonplaten

@anton-l @patil-suraj would love to hear your opinion here.

I would have preferred to just throw a warning given the goal of this PR, but I also see why it could make sense to return text_embeddings - what do you think?

@shirayu - either way we can only return the text embeddings optionally (add a return_embeddings=True/False flag) to the __call__ to not break the outputs.

Overall, I'm leaning towards not adding this functionality though as it will add another argument to the __call__ API and another output to stable diffusion for IMO quite an edge case.

patrickvonplaten · 2022-09-22T13:46:28Z

Keen to hear you thoughs here @shirayu

patil-suraj · 2022-09-22T14:44:34Z

Thanks for the PR @shirayu !

Returning text_embeddings only to check if prompt is truncated or not doesn't seem ideal to me. I'm also in favor of throwing a warning instead. You can run the tokenizer before or after the pipeline to check if prompt is too long. Also running tokenizer is not an expensive operation, so I think it won't hurt anything.

shirayu · 2022-09-23T05:31:01Z

Thank you for your comments.
After reading those, I think that it doesn't necessarily need to return tokens from the pipeline.
Following @patrickvonplaten's comment, I've changed to log warnings.

Note, this will drop the last special token <|endoftext|> from the truncated tokens.

…ce#447) (huggingface#472) * Return encoded texts by DiffusionPipelines * Updated README to show hot to use enoded_text_input * Reverted examples in README.md * Reverted all * Warning for long prompts * Fix bugs * Formatted

shirayu changed the title ~~[WIP] Return encoded texts by DiffusionPipelines (Fix #447)~~ [WIP] Return encoded texts by DiffusionPipelines (Resolve #447) Sep 12, 2022

shirayu marked this pull request as ready for review September 12, 2022 11:22

shirayu changed the title ~~[WIP] Return encoded texts by DiffusionPipelines (Resolve #447)~~ Return encoded texts by DiffusionPipelines (Resolve #447) Sep 12, 2022

patrickvonplaten reviewed Sep 16, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

leszekhanusz mentioned this pull request Sep 18, 2022

Merging Stable diffusion pipelines just makes sense #551

Closed

patrickvonplaten reviewed Sep 22, 2022

View reviewed changes

patrickvonplaten assigned anton-l and patil-suraj Sep 22, 2022

patrickvonplaten reviewed Sep 22, 2022

View reviewed changes

shirayu added 5 commits September 23, 2022 13:32

Return encoded texts by DiffusionPipelines

9b85f7c

Updated README to show hot to use enoded_text_input

9b8fa56

Reverted examples in README.md

3081070

Reverted all

c952691

Warning for long prompts

006e3ae

shirayu changed the title ~~Return encoded texts by DiffusionPipelines (Resolve #447)~~ Warning for too long prompts in DiffusionPipelines (Resolve #447) Sep 23, 2022

shirayu added 2 commits September 23, 2022 14:14

Fix bugs

6448321

Formatted

d651eaa

shirayu requested a review from patrickvonplaten September 23, 2022 05:31

patrickvonplaten merged commit f7ebe56 into huggingface:main Sep 27, 2022

shirayu mentioned this pull request Oct 4, 2022

Add an argument "negative_prompt" #549

Merged

shirayu deleted the feature/stable_diffusion/return_encoded_text branch October 23, 2022 12:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Warning for too long prompts in DiffusionPipelines (Resolve #447) #472

Warning for too long prompts in DiffusionPipelines (Resolve #447) #472

Uh oh!

shirayu commented Sep 12, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 12, 2022 •

edited

Loading

Uh oh!

patrickvonplaten commented Sep 16, 2022 •

edited

Loading

Uh oh!

Uh oh!

shirayu commented Sep 17, 2022

Uh oh!

patrickvonplaten commented Sep 17, 2022

Uh oh!

patrickvonplaten commented Sep 17, 2022

Uh oh!

patrickvonplaten Sep 22, 2022

Uh oh!

patrickvonplaten left a comment

Uh oh!

patrickvonplaten commented Sep 22, 2022

Uh oh!

patil-suraj commented Sep 22, 2022 •

edited

Loading

Uh oh!

shirayu commented Sep 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Warning for too long prompts in DiffusionPipelines (Resolve #447) #472

Warning for too long prompts in DiffusionPipelines (Resolve #447) #472

Uh oh!

Conversation

shirayu commented Sep 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrickvonplaten commented Sep 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

shirayu commented Sep 17, 2022

Uh oh!

patrickvonplaten commented Sep 17, 2022

Uh oh!

patrickvonplaten commented Sep 17, 2022

Uh oh!

patrickvonplaten Sep 22, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Sep 22, 2022

Uh oh!

patil-suraj commented Sep 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shirayu commented Sep 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shirayu commented Sep 12, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 12, 2022 •

edited

Loading

patrickvonplaten commented Sep 16, 2022 •

edited

Loading

patil-suraj commented Sep 22, 2022 •

edited

Loading