[diffusers]: Model Cache

Model loading is significantly different with diffusers, and I'm not sure how best to integrate it with the existing ModelCache: https://github.com/invoke-ai/InvokeAI/blob/c607d4fe6ccb0f13056d5fee5346989255fe3ccd/ldm/invoke/model_cache.py#L29-L37

diffusers takes advantage of [🤗accelerate] by default. I don't know much about that library, but it _seems like_ the sort of "offload this model state to CPU until we need it again" stuff ModelCache is doing is already implemented there: [Dispatching and Offloading Models]. I hope this means we can drop a lot of the existing code from ModelCache.

[🤗accelerate]: https://huggingface.co/docs/accelerate/
[Dispatching and Offloading Models]: https://huggingface.co/docs/accelerate/package_reference/big_modeling#accelerate.init_empty_weights

I haven't been using ModelCache's "offload to cpu" functionality even on the main branch because it always took way more memory than I expected it to and quickly summoned the Out-Of-Memory Killer.

I _do_ have fast storage and I _don't_ have a ton of spare RAM, so I don't think I'm the target audience for the model caching/offloading feature and I need to delegate the ModelCache/diffusers integration to someone who properly appreciates it.

Some of this potential integration with _accelerate_ could probably even be done on the main branch. But due to the fact that #1583 already changes the ModelCache file a fair bit, and the fact that the way we interact with _accelerate_ is probably a little different with and without diffusers (because it already sets it up somewhat), I expect a PR for this should target `dev/diffusers` and not `main`.

	class ModelCache(object):
	def __init__(self, config:OmegaConf, device_type:str, precision:str, max_loaded_models=DEFAULT_MAX_MODELS):
	'''
	Initialize with the path to the models.yaml config file,
	the torch device type, and precision. The optional
	min_avail_mem argument specifies how much unused system
	(CPU) memory to preserve. The cache of models in RAM will
	grow until this value is approached. Default is 2G.
	'''

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[diffusers]: Model Cache #1777

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[diffusers]: Model Cache #1777

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions