-
Notifications
You must be signed in to change notification settings - Fork 2.7k
new OffloadingDevice loads one model at a time, on demand #2596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
0cfebdf
new OffloadingDevice loads one model at a time, on demand
keturn 79c454a
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn 4a9b0fc
fixup! new OffloadingDevice loads one model at a time, on demand
keturn 69873d9
fix(prompt_to_embeddings): call the text encoder directly instead of …
keturn 9d5ab9e
more attempts to get things on the right device from the offloader
keturn f39c806
more attempts to get things on the right device from the offloader
keturn 337d179
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn 20df847
make offloading methods an explicit part of the pipeline interface
keturn f3e03e4
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn ac0746f
inlining some calls where device is only used once
keturn 127c1b8
ensure model group is ready after pipeline.to is called
keturn 52563ae
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn 36bbb09
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn 26444af
fixup! Strategize slicing based on free [V]RAM (#2572)
keturn 42ee1c6
doc(offloading): docstrings for offloading.ModelGroup
keturn 0dcfb6f
doc(offloading): docstrings for offloading-related pipeline methods
keturn ae73997
Merge remote-tracking branch 'origin/main' into spike/offloading-device
keturn 4a28326
refactor(offloading): s/SimpleModelGroup/FullyLoadedModelGroup
keturn 10547e4
refactor(offloading): s/HotSeatModelGroup/LazilyLoadedModelGroup
keturn b4355a1
Merge branch 'main' into spike/offloading-device
damian0815 dca5561
Merge branch 'main' into spike/offloading-device
lstein File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.