[Docs] Adds a documentation page for evaluating diffusion models #2516

sayakpaul · 2023-02-28T15:46:51Z

Should be useful to the community.

HuggingFaceDocBuilderDev · 2023-02-28T15:51:02Z

The documentation is not available anymore as the PR was closed or merged.

docs/source/en/conceptual/evaluation.mdx

patrickvonplaten

Very cool! Left some suggestions

Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Kashif Rasul <[email protected]>

pcuenca

Very cool!

docs/source/en/conceptual/evaluation.mdx

pcuenca · 2023-03-01T09:46:30Z

docs/source/en/conceptual/evaluation.mdx

+
+![edit-instruction](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/evaluation_diffusion_models/edit-instruction.png)
+
+One strategy to evaluate such a model is to measure the consistency of the change between the two images (in [CLIP](https://huggingface.co/docs/transformers/model_doc/clip) space) with the change between the two image captions (as shown in [CLIP-Guided Domain Adaptation of Image Generators](https://arxiv.org/abs/2108.00946)). This is referred to as the "**CLIP directional similarity**".


Nice! Maybe we can draw parallels with "guidance scale" here?

Like the following?

One could consider this measure to be orthogonal to the use of guidance_scale in the DiffusionPipeline. The higher the guidance_scale, the more constrained the generation becomes on the input text prompt.

I meant that the method looks similar to the use of guidance scale to encourage the model to go in a direction that improves caption-image similarity. But reading the full post again is going to be more confusing than clarifying, so I'd just leave it as it is now :)

docs/source/en/conceptual/evaluation.mdx

pcuenca · 2023-03-01T10:01:51Z

Maybe also mention that our training scripts have built-in tensorboard and W&B logging, so we recommend you check both qualitative and quantitative measures while training.

Co-authored-by: Pedro <[email protected]>

Co-authored-by: Pedro Cuenca <[email protected]>

Co-authored-by: Pedro <[email protected]>

sayakpaul · 2023-03-01T11:07:47Z

@pcuenca thanks so much for your comments! I addressed all of them except for #2516 (comment). Let me know your thoughts on the latest changes.

docs/source/en/conceptual/evaluation.mdx

yiyixuxu

Awesome! thank you so much for adding this! I learnt a lot from it

docs/source/en/conceptual/evaluation.mdx

Co-authored-by: Will Berman <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

sayakpaul · 2023-03-13T09:40:43Z

Thanks for all the reviews. I have addressed all of them. I figured that the readers of the doc might want to do hands-on with the content presented in it. So, worked on huggingface/notebooks#336 as well.

@williamberman @pcuenca could you do one final pass and comment?

williamberman

nice!

pcuenca

Very cool! I like it a lot, I think it's a great introduction to evaluation methods.

docs/source/en/conceptual/evaluation.mdx

pcuenca · 2023-03-15T09:48:16Z

docs/source/en/conceptual/evaluation.mdx

+
+![edit-instruction](https://huggingface.co/datasets/diffusers/docs-images/resolve/main/evaluation_diffusion_models/edit-instruction.png)
+
+One strategy to evaluate such a model is to measure the consistency of the change between the two images (in [CLIP](https://huggingface.co/docs/transformers/model_doc/clip) space) with the change between the two image captions (as shown in [CLIP-Guided Domain Adaptation of Image Generators](https://arxiv.org/abs/2108.00946)). This is referred to as the "**CLIP directional similarity**".


I meant that the method looks similar to the use of guidance scale to encourage the model to go in a direction that improves caption-image similarity. But reading the full post again is going to be more confusing than clarifying, so I'd just leave it as it is now :)

docs/source/en/conceptual/evaluation.mdx

Co-authored-by: Pedro Cuenca <[email protected]>

huggingface/diffusers#2516

…#336) * Add files via upload * update notebook as per PR comments huggingface/diffusers#2516

…gingface#2516) * add a documentation page for evaluating diffuion models. * fix: checkpoint link. * Apply suggestions from code review Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Kashif Rasul <[email protected]> * formatting fixes. * formatting fixes. * link to partiprompts dataset on hub. * reflect on Pedro's comments. Co-authored-by: Pedro <[email protected]> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> * reflect on Pedro's comments. Co-authored-by: Pedro <[email protected]> * update mention of FID. * Apply suggestions from code review Co-authored-by: Will Berman <[email protected]> Co-authored-by: YiYi Xu <[email protected]> * minor nit. * finish edges and add colab notebook. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> * run formatting. * additional feedback. --------- Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Kashif Rasul <[email protected]> Co-authored-by: Pedro <[email protected]> Co-authored-by: Will Berman <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

add a documentation page for evaluating diffuion models.

f8619d9

sayakpaul requested review from kashif, patrickvonplaten, pcuenca, williamberman and yiyixuxu February 28, 2023 15:46

sayakpaul marked this pull request as ready for review February 28, 2023 15:47

osanseviero reviewed Mar 1, 2023

View reviewed changes

docs/source/en/conceptual/evaluation.mdx Outdated Show resolved Hide resolved

fix: checkpoint link.

9e0f2fb

kashif reviewed Mar 1, 2023

View reviewed changes

docs/source/en/conceptual/evaluation.mdx Show resolved Hide resolved

patrickvonplaten reviewed Mar 1, 2023

View reviewed changes