Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) #2

Marlon154 · 2024-08-17T18:54:17Z

add sega implementation for FLUX and HunyuanDiT

Co-authored-by: Sayak Paul <[email protected]>

…s. (huggingface#8692) create a utility for calculating the expected number of shards.

…huggingface#8630) * add clip text-encoder training * no dora * text encoder traing fixes * text encoder traing fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * add text_encoder layers to save_lora * style * fix imports * style * fix text encoder * review changes * review changes * review changes * minor change * add lora tag * style * add readme notes * add tests for clip encoders * style * typo * fixes * style * Update tests/lora/test_lora_layers_sd3.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/README_sd3.md Co-authored-by: Sayak Paul <[email protected]> * minor readme change --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

…gingface#8696) okay

* first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <[email protected]> Co-authored-by: Ahn Donghoon (안동훈 / suno) <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: Steven Liu <[email protected]>

* doc for max_sequence_length * better position and changed note to tip * apply suggestions --------- Co-authored-by: Sayak Paul <[email protected]>

add conversion script

* add docs on model sharding * add entry to _toctree. * Apply suggestions from code review Co-authored-by: Steven Liu <[email protected]> * simplify wording * add a note on transformer library handling * move device placement section * Update docs/source/en/training/distributed_inference.md Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

* update * update --------- Co-authored-by: Sayak Paul <[email protected]>

* add more about from_pipe API * Update docs/source/en/using-diffusers/pag.md * Update docs/source/en/using-diffusers/pag.md --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

…ace#8694) * add controlnet support --------- Co-authored-by: xingchaoliu <[email protected]> Co-authored-by: yiyixuxu <yixu310@gmail,com>

…lines. (huggingface#8676) * add reporting mechanism when mirroring community pipelines. * remove unneeded argument * get the actual PATH_IN_REPO * don't need tag

…#8699) * fix: unet save_attn_procs at custom diffusion * style: recover unchanaged parts(max line length 119) / mod: add condition * style: recover unchanaged parts(max line length 119) --------- Co-authored-by: Sayak Paul <[email protected]>

…ass. (huggingface#8698) * remove deprecation from transformer2d regarding the output class. * up * deprecate more

fix vanilla fine-tuned lora loading.

update

…ingface#8688) fix conversion utility so that lora dora loads correctly

* modify PR and issue templates * add single file poc.

…face#8718) add some info when there is an error.

* initial fix * apply suggestion * delete step_index line

* Add check for WindowsPath in to_json_string On Windows, os.path.join returns a WindowsPath. to_json_string does not convert this from a WindowsPath to a string. Added check for WindowsPath to to_json_saveable. * Remove extraneous convert to string in test_check_path_types (tests/others/test_config.py) * Fix style issues in tests/others/test_config.py * Add unit test to test_config.py to verify that PosixPath and WindowsPath (depending on system) both work when converted to JSON * Remove distinction between PosixPath and WindowsPath in ConfigMixIn.to_json_string(). Conditional now tests for Path, and uses Path.as_posix() to convert to string. --------- Co-authored-by: Vincent Dovydaitis <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

add pag sd3 --------- Co-authored-by: HyoungwonCho <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: crepejung00 <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Aryan <[email protected]> Co-authored-by: Aryan <[email protected]>

* Fix loading sharded checkpoint when we have variant * add test * remote print --------- Co-authored-by: Sayak Paul <[email protected]>

…ce#9083) * update * update * update --------- Co-authored-by: Sayak Paul <[email protected]>

* txt2img pag added * autopipe added, fixed case * style * apply suggestions * added fast tests, added todo tests * revert dummy objects for kolors * fix pag dummies * fix test imports * update pag tests * add kolor pag to docs --------- Co-authored-by: Sayak Paul <[email protected]>

* initial work draft for freenoise; needs massive cleanup * fix freeinit bug * add animatediff controlnet implementation * revert attention changes * add freenoise * remove old helper functions * add decode batch size param to all pipelines * make style * fix copied from comments * make fix-copies * make style * copy animatediff controlnet implementation from huggingface#8972 * add experimental support for num_frames not perfectly fitting context length, ocntext stride * make unet motion model lora work again based on huggingface#8995 * copy load video utils from huggingface#8972 * copied from AnimateDiff::prepare_latents * address the case where last batch of frames does not match length of indices in prepare latents * decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid * revert sparsectrl and sdxl freenoise changes * revert pia * add freenoise tests * make fix-copies * improve docstrings * add freenoise tests to animatediff controlnet * update tests * Update src/diffusers/models/unets/unet_motion_model.py * add freenoise to animatediff pag * address review comments * make style * update tests * make fix-copies * fix error message * remove copied from comment * fix imports in tests * update --------- Co-authored-by: Dhruv Nair <[email protected]>

* clipping for fp16 * fix typo * added fp16 inference to docs * fix docs typo * include link for fp16 investigation --------- Co-authored-by: Sayak Paul <[email protected]>

* allow sparsectrl to be loaded with single file * update --------- Co-authored-by: Dhruv Nair <[email protected]>

…#9110) * update * update

* add CogVideoX --------- Co-authored-by: Aryan <[email protected]> Co-authored-by: sayakpaul <[email protected]> Co-authored-by: Aryan <[email protected]> Co-authored-by: yiyixuxu <[email protected]> Co-authored-by: Steven Liu <[email protected]>

* toctree * fix

* fix for lr scheduler in distributed training * Fixed the recalculation of the total training step section * Fixed lint error --------- Co-authored-by: Sayak Paul <[email protected]>

Co-authored-by: Aryan <[email protected]>

* Add Differential Pipeline. * Fix Styling Issue using ruff -fix * Add details to Contributing.md * Revert "Fix Styling Issue using ruff -fix" This reverts commit d347de1. * Revert "Revert "Fix Styling Issue using ruff -fix"" This reverts commit ce7c3ff. * Revert README changes * Restore README.md * Update README.md * Resolved Comments: * Fix Readme based on review * Fix formatting after make style --------- Co-authored-by: Aryan <[email protected]>

Co-authored-by: YiYi Xu <[email protected]>

* initial commit - dreambooth for flux * update transformer to be FluxTransformer2DModel * update training loop and validation inference * fix sd3->flux docs * add guidance handling, not sure if it makes sense(?) * inital dreambooth lora commit * fix text_ids in compute_text_embeddings * fix imports of static methods * fix pipeline loading in readme, remove auto1111 docs for now * fix pipeline loading in readme, remove auto1111 docs for now, remove some irrelevant text_encoder_3 refs * Update examples/dreambooth/train_dreambooth_flux.py Co-authored-by: Bagheera <[email protected]> * fix te2 loading and remove te2 refs from text encoder training * fix tokenizer_2 initialization * remove text_encoder training refs from lora script (for now) * try with vae in bfloat16, fix model hook save * fix tokenization * fix static imports * fix CLIP import * remove text_encoder training refs (for now) from lora script * fix minor bug in encode_prompt, add guidance def in lora script, ... * fix unpack_latents args * fix license in readme * add "none" to weighting_scheme options for uniform sampling * style * adapt model saving - remove text encoder refs * adapt model loading - remove text encoder refs * initial commit for readme * Update examples/dreambooth/train_dreambooth_lora_flux.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/train_dreambooth_lora_flux.py Co-authored-by: Sayak Paul <[email protected]> * fix vae casting * remove precondition_outputs * readme * readme * style * readme * readme * update weighting scheme default & docs * style * add text_encoder training to lora script, change vae_scale_factor value in both * style * text encoder training fixes * style * update readme * minor fixes * fix te params * fix te params --------- Co-authored-by: Bagheera <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

…ingface#9010) * Fix textual inversion SDXL and add support for 2nd text encoder Signed-off-by: Daniel Socek <[email protected]> * Fix style/quality of text inv for sdxl Signed-off-by: Daniel Socek <[email protected]> --------- Signed-off-by: Daniel Socek <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* resolve peft links * fuse_lora

# Conflicts: # src/diffusers/__init__.py

Marlon154 · 2024-08-17T18:55:23Z

@manuelbrack

Semantic sd3

Add SEGA to AuraFlow

Marlon154 · 2024-08-17T20:22:14Z

suggest to first bump main before merging

Marlon154 and others added 30 commits June 4, 2024 22:07

WIP

1692110

WIP

a869406

WIP

20f551a

Allow for for pipeline call without editing prompt

f06379f

fixes

13d0754

fix multi edits

9716464

Merge branch 'huggingface:main' into semantic-hunyuan

4b084f0

moved to own package

7824a6a

moved to own package and update the modules

8d27916

Fix redundant pipe init in sd3 lora (huggingface#8680)

1f81fbe

Co-authored-by: Sayak Paul <[email protected]>

[Chore] create a utility for calculating the expected number of shard…

4ad7a1f

…s. (huggingface#8692) create a utility for calculating the expected number of shards.

[Marigold tests] add is_flaky decorator to some Marigold tests (hug…

f088027

…gingface#8696) okay

[Docs] SD3 T5 Token limit doc (huggingface#8654)

14d224d

* doc for max_sequence_length * better position and changed note to tip * apply suggestions --------- Co-authored-by: Sayak Paul <[email protected]>

add sd3 conversion script (huggingface#8702)

715a7da

add conversion script

Add decorator for compile tests (huggingface#8703)

0f0b531

* update * update --------- Co-authored-by: Sayak Paul <[email protected]>

[doc] add more about from_pipe API for PAG doc (huggingface#8701)

1d3ef67

* add more about from_pipe API * Update docs/source/en/using-diffusers/pag.md * Update docs/source/en/using-diffusers/pag.md --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>

[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference (huggingf…

fa2abfd

…ace#8694) * add controlnet support --------- Co-authored-by: xingchaoliu <[email protected]> Co-authored-by: yiyixuxu <yixu310@gmail,com>

[Observability] add reporting mechanism when mirroring community pipe…

8ef0d9d

…lines. (huggingface#8676) * add reporting mechanism when mirroring community pipelines. * remove unneeded argument * get the actual PATH_IN_REPO * don't need tag

[Chore] remove deprecation from transformer2d regarding the output cl…

10b4e35

…ass. (huggingface#8698) * remove deprecation from transformer2d regarding the output class. * up * deprecate more

[LoRA] fix vanilla fine-tuned lora loading. (huggingface#8691)

5b51ad0

fix vanilla fine-tuned lora loading.

Update xformers SD3 test (huggingface#8712)

effe4b9

update

[LoRA] fix conversion utility so that lora dora loads correctly (hugg…

adbb048

…ingface#8688) fix conversion utility so that lora dora loads correctly

modify PR and issue templates (huggingface#8687)

eda560d

* modify PR and issue templates * add single file poc.

[Release notification] add some info when there is an error. (hugging…

e2a4a46

…face#8718) add some info when there is an error.

Modify FlowMatch Scale Noise (huggingface#8678)

3b01d72

* initial fix * apply suggestion * delete step_index line

sunovivid and others added 24 commits August 6, 2024 09:11

Fix loading sharded checkpoints when we have variants (huggingface#9061)

e432560

* Fix loading sharded checkpoint when we have variant * add test * remote print --------- Co-authored-by: Sayak Paul <[email protected]>

[Single File] Add single file support for Flux Transformer (huggingfa…

e1b603d

…ce#9083) * update * update * update --------- Co-authored-by: Sayak Paul <[email protected]>

fix train_dreambooth_lora_sd3.py loading hook (huggingface#9107)

2d753b6

Flux fp16 inference fix (huggingface#9097)

9b5180c

* clipping for fp16 * fix typo * added fp16 inference to docs * fix docs typo * include link for fp16 investigation --------- Co-authored-by: Sayak Paul <[email protected]>

[feat] allow sparsectrl to be loaded from single file (huggingface#9073)

f6df224

* allow sparsectrl to be loaded with single file * update --------- Co-authored-by: Dhruv Nair <[email protected]>

Freenoise change vae_batch_size to decode_chunk_size (huggingface…

e3568d1

…#9110) * update * update

[docs] Organize model toctree (huggingface#9118)

ba7e484

* toctree * fix

fix for lr scheduler in distributed training (huggingface#9103)

8e3affc

* fix for lr scheduler in distributed training * Fixed the recalculation of the total training step section * Fixed lint error --------- Co-authored-by: Sayak Paul <[email protected]>

Fix a dead link (huggingface#9116)

ae026db

Co-authored-by: Aryan <[email protected]>

Update README.md to include InstantID (huggingface#8770)

cee7c1b

Co-authored-by: YiYi Xu <[email protected]>

[docs] Resolve internal links to PEFT (huggingface#9144)

98930ee

* resolve peft links * fuse_lora

Update pipeline_semantic_hunyuandit.py

b8e14d8

implement semantic flux

0deed2e

small fixes semantic hunyuan

9220a4f

Merge branch 'refs/heads/semantic-hunyuan' into semantic_dits

09eed8f

# Conflicts: # src/diffusers/__init__.py

Merge branch 'refs/heads/semantic_flux' into semantic_dits

3c91e7c

# Conflicts: # src/diffusers/__init__.py

small fixes semantic flux

2718d80

Marlon154 added 2 commits August 17, 2024 22:17

Merge pull request #2 from ml-research/semantic_sd3

932c130

Semantic sd3

Merge pull request #3 from ml-research/semantic-auraflow

c61658b

Add SEGA to AuraFlow

Marlon154 changed the title ~~Semantic flux & hunyuan~~ Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) Aug 17, 2024

manuelbrack merged commit 631b748 into ml-research:semantic_sd3 Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) #2

Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) #2

Uh oh!

Marlon154 commented Aug 17, 2024

Uh oh!

Marlon154 commented Aug 17, 2024

Uh oh!

Marlon154 commented Aug 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

59 participants

Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) #2

Add SEGA for DiTs (SD3, AurraFlow, Hunyuan, FLUX) #2

Uh oh!

Conversation

Marlon154 commented Aug 17, 2024

Uh oh!

Marlon154 commented Aug 17, 2024

Uh oh!

Marlon154 commented Aug 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

59 participants