new OffloadingDevice loads one model at a time, on demand #2596

keturn · 2023-02-09T04:23:04Z

Trying something different in place of #2542.

…its forward method allowing any associated hooks to run with it.

keturn · 2023-02-09T05:00:40Z

The accelerate feature the previous PR used does this terribly clever thing where the parameters are defined on the meta device until their module's forward method is called.

That often resulted in foo.device returning meta, which is useless for trying to prepare tensors you want to pass to that model, but at least it's obviously useless upon inspection.

In contrast, having models shuffled off to cpu means that foo.device is never the unusable meta, but it does mean that the foo.device you see now isn't necessarily what foo.device will be when you foo(), so we still have to be cautious about relying on .device.

keturn · 2023-02-09T07:15:52Z

It's minimally working now, tested for txt2img in text mode only.

There are still a few other things to clean up, e.g. web runs in to an error in lowres_estimated_image.

keturn · 2023-02-09T19:58:25Z

added a workaround for the estimated image function, then moved on to testing inpainting and that has its own bucket of problems.

I assumed that vae.decode was a problem because its forward method was encode and so decode was the "extra" method, but no. Its forward does both for some reason, so neither encode nor decode get the hooks.

And then it's got the usual set of "tensors prepared using vae.device end up on cpu" problems, which we have otherwise but are multiplied because inpainting deals with more stuff (input image, mask, etc).

I'm coming to believe that we can't do this completely transparently like this, with the Pipeline class not aware of how the model devices are managed. I think it needs a way to discover what their actual execution device is, whether that's through inquiring with an Offloader instance or some property we add to the models.

keturn · 2023-02-10T01:16:49Z

This is starting to shape up now, and the usage of it doesn't require nearly so many awkward workarounds as the previous attempt.

It's working in many cases, but my troubleshooting is currently impeded by something that's swallowing all the console output.

# Conflicts: # ldm/invoke/generator/diffusers_pipeline.py

ldm/invoke/generator/diffusers_pipeline.py

ldm/invoke/offloading.py

to frame it is the same terms as "FullyLoadedModelGroup"

damian0815 · 2023-02-16T20:01:46Z

i hit resolve on both of those conversations, thanks for making the naming change

lstein

Neat! Very elegant solution.

keturn added 4 commits February 8, 2023 20:13

new OffloadingDevice loads one model at a time, on demand

0cfebdf

Merge remote-tracking branch 'origin/main' into spike/offloading-device

79c454a

fixup! new OffloadingDevice loads one model at a time, on demand

4a9b0fc

fix(prompt_to_embeddings): call the text encoder directly instead of …

69873d9

…its forward method allowing any associated hooks to run with it.

keturn added 2 commits February 8, 2023 22:07

more attempts to get things on the right device from the offloader

9d5ab9e

more attempts to get things on the right device from the offloader

f39c806

Merge remote-tracking branch 'origin/main' into spike/offloading-device

337d179

keturn added 4 commits February 9, 2023 16:41

make offloading methods an explicit part of the pipeline interface

20df847

Merge remote-tracking branch 'origin/main' into spike/offloading-device

f3e03e4

inlining some calls where device is only used once

ac0746f

ensure model group is ready after pipeline.to is called

127c1b8

keturn added 4 commits February 13, 2023 15:26

Merge remote-tracking branch 'origin/main' into spike/offloading-device

52563ae

# Conflicts: # ldm/invoke/generator/diffusers_pipeline.py

Merge remote-tracking branch 'origin/main' into spike/offloading-device

36bbb09

fixup! Strategize slicing based on free [V]RAM (#2572)

26444af

doc(offloading): docstrings for offloading.ModelGroup

42ee1c6

keturn marked this pull request as ready for review February 15, 2023 04:30

keturn requested review from blessedcoolant, damian0815 and lstein as code owners February 15, 2023 04:30

doc(offloading): docstrings for offloading-related pipeline methods

0dcfb6f

damian0815 suggested changes Feb 15, 2023

View reviewed changes

ldm/invoke/generator/diffusers_pipeline.py Show resolved Hide resolved

ldm/invoke/offloading.py Outdated Show resolved Hide resolved

keturn added 3 commits February 15, 2023 18:01

Merge remote-tracking branch 'origin/main' into spike/offloading-device

ae73997

refactor(offloading): s/SimpleModelGroup/FullyLoadedModelGroup

4a28326

refactor(offloading): s/HotSeatModelGroup/LazilyLoadedModelGroup

10547e4

to frame it is the same terms as "FullyLoadedModelGroup"

Merge branch 'main' into spike/offloading-device

b4355a1

damian0815 self-requested a review February 16, 2023 20:02

damian0815 approved these changes Feb 16, 2023

View reviewed changes

damian0815 enabled auto-merge (squash) February 16, 2023 20:03

lstein approved these changes Feb 16, 2023

View reviewed changes

Merge branch 'main' into spike/offloading-device

dca5561

damian0815 merged commit 8a0d45a into main Feb 16, 2023

damian0815 deleted the spike/offloading-device branch February 16, 2023 23:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

new OffloadingDevice loads one model at a time, on demand #2596

new OffloadingDevice loads one model at a time, on demand #2596

Uh oh!

keturn commented Feb 9, 2023 •

edited

Loading

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 10, 2023

Uh oh!

Uh oh!

Uh oh!

damian0815 commented Feb 16, 2023 •

edited

Loading

Uh oh!

lstein left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

new OffloadingDevice loads one model at a time, on demand #2596

new OffloadingDevice loads one model at a time, on demand #2596

Uh oh!

Conversation

keturn commented Feb 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 9, 2023

Uh oh!

keturn commented Feb 10, 2023

Uh oh!

Uh oh!

Uh oh!

damian0815 commented Feb 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lstein left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

keturn commented Feb 9, 2023 •

edited

Loading

damian0815 commented Feb 16, 2023 •

edited

Loading