Euler A redesign merge #2

AbdullahAlfaraj · 2022-10-15T03:06:14Z

getting EulerA scheduler more up to date with the main huggingface diffusers branch

…ce#447) (huggingface#472) * Return encoded texts by DiffusionPipelines * Updated README to show hot to use enoded_text_input * Reverted examples in README.md * Reverted all * Warning for long prompts * Fix bugs * Formatted

the link points to an old location of the train_unconditional.py file

* Remove deprecated `torch_device` kwarg. * Remove unused imports.

Signed-off-by: Ryan Russell <[email protected]> Signed-off-by: Ryan Russell <[email protected]>

* WIP: flax FlaxDiffusionPipeline & FlaxStableDiffusionPipeline * todo comment * Fix imports * Fix imports * add dummies * Fix empty init * make pipeline work * up * Allow dtype to be overridden on model load. This may be a temporary solution until huggingface#567 is addressed. * Convert params to bfloat16 or fp16 after loading. This deals with the weights, not the model. * Use Flax schedulers (typing, docstring) * PNDM: replace control flow with jax functions. Otherwise jitting/parallelization don't work properly as they don't know how to deal with traced objects. I temporarily removed `step_prk`. * Pass latents shape to scheduler set_timesteps() PNDMScheduler uses it to reserve space, other schedulers will just ignore it. * Wrap model imports inside availability checks. * Optionally return state in from_config. Useful for Flax schedulers. * Do not convert model weights to dtype. * Re-enable PRK steps with functional implementation. Values returned still not verified for correctness. * Remove left over has_state var. * make style * Apply suggestion list -> tuple Co-authored-by: Suraj Patil <[email protected]> * Apply suggestion list -> tuple Co-authored-by: Suraj Patil <[email protected]> * Remove unused comments. * Use zeros instead of empty. Co-authored-by: Mishig Davaadorj <[email protected]> Co-authored-by: Mishig Davaadorj <[email protected]> Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Suraj Patil <[email protected]>

* Fix SpatialTransformer * Fix SpatialTransformer Co-authored-by: ydshieh <[email protected]>

* Add training example for DreamBooth. * Fix bugs. * Update readme and default hyperparameters. * Reformatting code with black. * Update for multi-gpu trianing. * Apply suggestions from code review * improgve sampling * fix autocast * improve sampling more * fix saving * actuallu fix saving * fix saving * improve dataset * fix collate fun * fix collate_fn * fix collate fn * fix key name * fix dataset * fix collate fn * concat batch in collate fn * add grad ckpt * add option for 8bit adam * do two forward passes for prior preservation * Revert "do two forward passes for prior preservation" This reverts commit 661ca46. * add option for prior_loss_weight * add option for clip grad norm * add more comments * update readme * update readme * Apply suggestions from code review Co-authored-by: Patrick von Platen <[email protected]> * add docstr for dataset * update the saving logic * Update examples/dreambooth/README.md * remove unused imports Co-authored-by: Suraj Patil <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

* pytorch only schedulers * fix style * remove match_shape * pytorch only ddpm * remove SchedulerMixin * remove numpy from karras_ve * fix types * remove numpy from lms_discrete * remove numpy from pndm * fix typo * remove mixin and numpy from sde_vp and ve * remove remaining tensor_format * fix style * sigmas has to be torch tensor * removed set_format in readme * remove set format from docs * remove set_format from pipelines * update tests * fix typo * continue to use mixin * fix imports * removed unsed imports * match shape instead of assuming image shapes * remove import typo * update call to add_noise * use math instead of numpy * fix t_index * removed commented out numpy tests * timesteps needs to be discrete * cast timesteps to int in flax scheduler too * fix device mismatch issue * small fix * Update src/diffusers/schedulers/scheduling_pndm.py Co-authored-by: Patrick von Platen <[email protected]>

…face#649) don't pass tensor_format

update install section

fix add noise

* add dep. warning for schedulers * fix format

…ace#653) remove set_format from pipeline

fix np onnx

) * Replace deprecation warning f-string with class name. When `__repr__` is invoked in the instance serialization of `config_dict` fails, because it contains `kwargs` of type `<class inspect._empty>`. * Revert "Replace deprecation warning f-string with class name." This reverts commit 1c4eb8c. * Do not attempt to register `"kwargs"` as an attribute. Otherwise serialization could fail. This may happen for other attributes, so we should create a better solution.

* Fix the LMS pytorch regression * Copy over the changes from huggingface#637 * Copy over the changes from huggingface#637 * Fix betas test

…ggingface#645) * Added script to save during training * Suggested changes

…face#667) take the correct text embeddings

update transfomrers version in example

* lowe tolerance * put model in eval mode

Flax from_pretrained: clean up `mismatched_keys`. Originally removed in 73e0bc6.

* correcting the beta value assignment * updating DDIM and LMSDiscreteFlax schedulers * bringing back the changes that were lost as part of main branch merge

renamed x to hidden_states

* initial commit * make UNet stream capturable * try to fix noise_pred value * remove cuda graph and keep NB * non blocking unet with PNDMScheduler * make timesteps np arrays for pndm scheduler because lists don't get formatted to tensors in `self.set_format` * make max async in pndm * use channel last format in unet * avoid moving timesteps device in each unet call * avoid memcpy op in `get_timestep_embedding` * add `channels_last` kwarg to `DiffusionPipeline.from_pretrained` * update TODO * replace `channels_last` kwarg with `memory_format` for more generality * revert the channels_last changes to leave it for another PR * remove non_blocking when moving input ids to device * remove blocking from all .to() operations at beginning of pipeline * fix merging * fix merging * model can run in other precisions without autocast * attn refactoring * Revert "attn refactoring" This reverts commit 0c70c0e. * remove restriction to run conv_norm in fp32 * use `baddbmm` instead of `matmul`for better in attention for better perf * removing all reshapes to test perf * Revert "removing all reshapes to test perf" This reverts commit 006ccb8. * add shapes comments * hardcore whats needed for jitting * Revert "hardcore whats needed for jitting" This reverts commit 2fa9c69. * Revert "remove restriction to run conv_norm in fp32" This reverts commit cec5928. * revert using baddmm in attention's forward * cleanup comment * remove restriction to run conv_norm in fp32. no quality loss was noticed This reverts commit cc9bc13. * add more optimizations techniques to docs * Revert "add shapes comments" This reverts commit 31c58ea. * apply suggestions * make quality * apply suggestions * styling * `scheduler.timesteps` are now arrays so we dont need .to() * remove useless .type() * use mean instead of max in `test_stable_diffusion_inpaint_pipeline_k_lms` * move scheduler timestamps to correct device if tensors * add device to `set_timesteps` in LMSD scheduler * `self.scheduler.set_timesteps` now uses device arg for schedulers that accept it * quick fix * styling * remove kwargs from schedulers `set_timesteps` * revert to using max in K-LMS inpaint pipeline test * Revert "`self.scheduler.set_timesteps` now uses device arg for schedulers that accept it" This reverts commit 00d5a51. * move timesteps to correct device before loop in SD pipeline * apply previous fix to other SD pipelines * UNet now accepts tensor timesteps even on wrong device, to avoid errors - it shouldnt affect performance if timesteps are alrdy on correct device - it does slow down performance if they're on the wrong device * fix pipeline when timesteps are arrays with strides

* Allow resolutions that are not multiples of 64 * ran black * fix bug * add test * more explanation * more comments Co-authored-by: Patrick von Platen <[email protected]>

…uggingface#680) refactor: update ldm-bert `config.json` url Signed-off-by: Ryan Russell <[email protected]> Signed-off-by: Ryan Russell <[email protected]>

* Fix push_to_hub for dreambooth and textual_inversion * Use repo.push_to_hub instead of push_to_hub

The opset argument should be an `int` but was set as a `str`.

…huggingface#731) This is to ensure that the final latent slices stay somewhat consistent as more changes are introduced into the library. Signed-off-by: James R T <[email protected]> Signed-off-by: James R T <[email protected]>

Otherwise, it crashes when eta > 0 with float16.

* handle dtype in vae and image2image pipeline * handle dtype in add noise * don't modify vae and pipeline * remove the if

* handle dtype in vae and image2image pipeline * fix inpaint in fp16 * dtype should be handled in add_noise * style * address review comments * add simple fast tests to check fp16 * fix test name * put mask in fp16

* Fix tests * remove bogus file

* debug an exception if dst_path is not a file, it will raise Exception in the function src_path.samefile: FileNotFoundError: [Errno 2] No such file or directory: '/home/lilongwei/notebook/onnx_diffusion/vae_decoder/model.onnx' * Update src/diffusers/onnx_utils.py Co-authored-by: Anton Lozhkov <[email protected]>

* clean up resnet.py * make style and quality * minor formatting

* add sigmoid betas * convert to torch * add comment on source

* add accelerate to load models with smaller memory footprint * remove low_cpu_mem_usage as it is reduntant * move accelerate init weights context to modelling utils * add test to ensure results are the same when loading with accelerate * add tests to ensure ram usage gets lower when using accelerate * move accelerate logic to single snippet under modelling utils and remove it from configuration utils * format code using to pass quality check * fix imports with isor * add accelerate to test extra deps * only import accelerate if device_map is set to auto * move accelerate availability check to diffusers import utils * format code * add device map to pipeline abstraction * lint it to pass PR quality check * fix class check to use accelerate when using diffusers ModelMixin subclasses * use low_cpu_mem_usage in transformers if device_map is not available * NoModuleLayer * comment out tests * up * uP * finish * Update src/diffusers/pipelines/stable_diffusion/safety_checker.py * finish * uP * make style Co-authored-by: Pi Esposito <[email protected]>

* Fix gradient checkpointing test * more tsets

fix typo docstring

…e#735) * Support deepspeed * Dreambooth DeepSpeed documentation * Remove unnecessary casts, documentation Due to recent commits some casts to half precision are not necessary anymore. Mention that DeepSpeed's version of Adam is about 2x faster. * Review comments

* support bf16 for stable diffusion * fix typo * address review comments

* begin text2image script * loading the datasets, preprocessing & transforms * handle input features correctly * add gradient checkpointing support * fix output names * run unet in train mode not text encoder * use no_grad instead of freezing params * default max steps None * pad to longest * don't pad when tokenizing * fix encode on multi gpu * fix stupid bug * add random flip * add ema * fix ema * put ema on cpu * improve EMA model * contiguous_format * don't warp vae and text encode in accelerate * remove no_grad * use randn_like * fix resize * improve few things * log epoch loss * set log level * don't log each step * remove max_length from collate * style * add report_to option * make scale_lr false by default * add grad clipping * add an option to use 8bit adam * fix logging in multi-gpu, log every step * more comments * remove eval for now * adress review comments * add requirements file * begin readme * begin readme * fix typo * fix push to hub * populate readme * update readme * remove use_auth_token from the script * address some review comments * better mixed precision support * remove redundant to * create ema model early * Apply suggestions from code review Co-authored-by: Pedro Cuenca <[email protected]> * better description for train_data_dir * add diffusers in requirements * update dataset_name_mapping * update readme * add inference example Co-authored-by: anton-l <[email protected]> Co-authored-by: Pedro Cuenca <[email protected]>

* pass norm_num_groups param and add tests * set resnet_groups for FlaxUNetMidBlock2D * fixed docstrings * fixed typo * using is_flax_available util and created require_flax decorator

Update custom_pipelines.mdx

…e#766) * mps: alt. implementation for repeat_interleave * style * Bump mps version of PyTorch in the documentation. * Apply suggestions from code review Co-authored-by: Suraj Patil <[email protected]> * Simplify: do not check for device. * style * Fix repeat dimensions: - The unconditional embeddings are always created from a single prompt. - I was shadowing the batch_size var. * Split long lines as suggested by Suraj. Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Suraj Patil <[email protected]>

HuggingFaceDocBuilderDev · 2022-10-15T03:10:19Z

The documentation is not available anymore as the PR was closed or merged.

shirayu and others added 30 commits September 27, 2022 11:14

Fix docs link to train_unconditional.py (huggingface#642)

bb0c5d1

the link points to an old location of the train_unconditional.py file

Remove deprecated torch_device kwarg (huggingface#623)

b671cb0

* Remove deprecated `torch_device` kwarg. * Remove unused imports.

refactor: custom_init_isort readability fixups (huggingface#631)

b694531

Signed-off-by: Ryan Russell <[email protected]> Signed-off-by: Ryan Russell <[email protected]>

Remove inappropriate docstrings in LMS docstrings. (huggingface#634)

c070e5f

Fix SpatialTransformer (huggingface#578)

d886e49

* Fix SpatialTransformer * Fix SpatialTransformer Co-authored-by: ydshieh <[email protected]>

[examples/dreambooth] don't pass tensor_format to scheduler. (hugging…

ac665b6

…face#649) don't pass tensor_format

[dreambooth] update install section (huggingface#650)

e5eed52

update install section

[DDIM, DDPM] fix add_noise (huggingface#648)

3304538

fix add noise

[Pytorch] add dep. warning for pytorch schedulers (huggingface#651)

85494e8

* add dep. warning for schedulers * fix format

[CLIPGuidedStableDiffusion] remove set_format from pipeline (huggingf…

c0c98df

…ace#653) remove set_format from pipeline

Fix onnx tensor format (huggingface#654)

d8572f2

fix np onnx

Fix the LMS pytorch regression (huggingface#664)

765506c

* Fix the LMS pytorch regression * Copy over the changes from huggingface#637 * Copy over the changes from huggingface#637 * Fix betas test

Added script to save during textual inversion training. Issue 524 (hu…

7f31142

…ggingface#645) * Added script to save during training * Suggested changes

[CLIPGuidedStableDiffusion] take the correct text embeddings (hugging…

c16761e

…face#667) take the correct text embeddings

Update index.mdx (huggingface#670)

f5b9bc8

[examples] update transfomers version (huggingface#665)

210be4f

update transfomrers version in example

[gradient checkpointing] lower tolerance for test (huggingface#652)

84b9df5

* lowe tolerance * put model in eval mode

Flax from_pretrained: clean up mismatched_keys. (huggingface#630)

f10576a

Flax from_pretrained: clean up `mismatched_keys`. Originally removed in 73e0bc6.

trained_betas ignored in some schedulers (huggingface#635)

3dacbb9

* correcting the beta value assignment * updating DDIM and LMSDiscreteFlax schedulers * bringing back the changes that were lost as part of main branch merge

Renamed x -> hidden_states in resnet.py (huggingface#676)

a7058f4

renamed x to hidden_states

Allow resolutions that are not multiples of 64 (huggingface#505)

a784be2

* Allow resolutions that are not multiples of 64 * ran black * fix bug * add test * more explanation * more comments Co-authored-by: Patrick von Platen <[email protected]>

refactor: update ldm-bert config.json url closes huggingface#675 (h…

877bec8

…uggingface#680) refactor: update ldm-bert `config.json` url Signed-off-by: Ryan Russell <[email protected]> Signed-off-by: Ryan Russell <[email protected]>

[docs] fix table in fp16.mdx (huggingface#683)

daa2205

Update README.md

bb0f2a0

patrickvonplaten and others added 28 commits October 7, 2022 11:20

remove bogus folder

c93a8cc

remove bogus folder no.2

7258dc4

Fix push_to_hub for dreambooth and textual_inversion (huggingface#748)

906e410

* Fix push_to_hub for dreambooth and textual_inversion * Use repo.push_to_hub instead of push_to_hub

Fix ONNX conversion script opset argument type (huggingface#739)

75bb6d2

The opset argument should be an `int` but was set as a `str`.

fix(DDIM scheduler): use correct dtype for noise (huggingface#742)

cb0bf0b

Otherwise, it crashes when eta > 0 with float16.

[schedulers] hanlde dtype in add_noise (huggingface#767)

ec831b6

* handle dtype in vae and image2image pipeline * handle dtype in add noise * don't modify vae and pipeline * remove the if

[img2img, inpainting] fix fp16 inference (huggingface#769)

92d7086

* handle dtype in vae and image2image pipeline * fix inpaint in fp16 * dtype should be handled in add_noise * style * address review comments * add simple fast tests to check fp16 * fix test name * put mask in fp16

[Tests] Fix tests (huggingface#774)

f3983d1

* Fix tests * remove bogus file

Clean up resnet.py file (huggingface#780)

a73f8b7

* clean up resnet.py * make style and quality * minor formatting

add sigmoid betas (huggingface#777)

feaa732

* add sigmoid betas * convert to torch * add comment on source

Fix gradient checkpointing test (huggingface#797)

22963ed

* Fix gradient checkpointing test * more tsets

fix typo docstring in unet2d (huggingface#798)

71ca10c

fix typo docstring

support bf16 for stable diffusion (huggingface#792)

797b290

* support bf16 for stable diffusion * fix typo * address review comments

Flax: Trickle down norm_num_groups (huggingface#789)

a124204

* pass norm_num_groups param and add tests * set resnet_groups for FlaxUNetMidBlock2D * fixed docstrings * fixed typo * using is_flax_available util and created require_flax decorator

Eventually preserve this typo? :) (huggingface#804)

e895952

Fix indentation in the code example (huggingface#802)

757babf

Update custom_pipelines.mdx

Update img2img.mdx

c1b6ea3

Merge remote-tracking branch 'upstream/main' into euler_a_redesign_merge

e8b7396

remove batch_size

558ced5

Merge remote-tracking branch 'upstream/main' into euler_a_redesign_merge

3e95a3f

EulerAScheduler work again but break the redesign

8e59861

passing index to step() to access t and prev_t

a659e02

AbdullahAlfaraj merged commit 4df7a22 into main Oct 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Euler A redesign merge #2

Euler A redesign merge #2

Uh oh!

AbdullahAlfaraj commented Oct 15, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

34 participants

Euler A redesign merge #2

Euler A redesign merge #2

Uh oh!

Conversation

AbdullahAlfaraj commented Oct 15, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

34 participants

HuggingFaceDocBuilderDev commented Oct 15, 2022 •

edited

Loading