Add `Accelerator.is_available()` interface requirement #11797

ananthsub · 2022-02-07T17:17:47Z

What does this PR do?

Enables automatic hardware selection without duplicating code across Trainer & individual accelerator implementations. This can be further enhanced by the addition of an AcceleratorRegistry. The accelerator connector could iterate through all of the registered accelerators, call acc_cls.is_available() and determine the hardware immediately.

This aims to simplify the accelerator connector logic and rewrite effort in #11448
This would move the hardcoded, duplicated assertion logic from Trainer constructor checks to a single runtime check at trainer.fit/validate/test/predict calls

Given discussion on this PR, we can decide where the assertion on device availability should happen. Either in the accelerator init, setup_environment, or left up to individual accelerators to decide

Fixes #11818

Does your PR introduce any breaking changes? If yes, please list them.

Yes, this now:

raises a TypeError if a custom accelerator has not implemented the new abstract method
raises a RuntimeError if the configured hardware is not available during Trainer execution

Before submitting

Was this discussed/approved via a GitHub issue? (not for typos and docs)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you make sure to update the documentation with your changes? (if necessary)
Did you write any new necessary tests? (not for typos and docs)
Did you verify new and existing tests pass locally with your changes?
[n/a] Did you list all the breaking changes introduced by this pull request?
Did you update the CHANGELOG? (not for typos, docs, test updates, or internal minor changes/refactorings)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

Did you have fun?

Make sure you had fun coding 🙃

pytorch_lightning/accelerators/gpu.py

pytorch_lightning/accelerators/tpu.py

pytorch_lightning/accelerators/accelerator.py

pytorch_lightning/accelerators/cpu.py

pytorch_lightning/accelerators/ipu.py

tests/plugins/test_amp_plugins.py

carmocca · 2022-02-09T18:26:09Z

Does this close #11799 and #11798?

pytorch_lightning/accelerators/accelerator.py

pytorch_lightning/accelerators/gpu.py

tests/accelerators/test_cpu.py

tests/accelerators/test_ddp.py

just need some clarifications

ananthsub · 2022-02-09T21:35:07Z

Does this close #11799 and #11798?

Yes, I will close them after this is merged

pytorch_lightning/accelerators/cpu.py

ananthsub added the accelerator: cuda Compute Unified Device Architecture GPU label Feb 7, 2022

ananthsub requested review from Borda, SeanNaren, awaelchli, carmocca, justusschock, kaushikb11, rohitgr7, tchaton and williamFalcon as code owners February 7, 2022 17:17

carmocca approved these changes Feb 7, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Outdated Show resolved Hide resolved

carmocca added this to the 1.6 milestone Feb 7, 2022

This was referenced Feb 7, 2022

Add assertions to IPU accelerator for device availability #11798

Closed

Add assertions to TPU accelerator for device availability #11799

Closed

ananthsub changed the title ~~Add assertions to GPU accelerator for CUDA availability~~ Add assertions to GPU accelerator for device availability Feb 7, 2022

tchaton approved these changes Feb 7, 2022

View reviewed changes

mergify bot added the ready PRs ready to be merged label Feb 7, 2022

rohitgr7 reviewed Feb 7, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Outdated Show resolved Hide resolved

mergify bot added the has conflicts label Feb 7, 2022

four4fish approved these changes Feb 7, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Outdated Show resolved Hide resolved

four4fish mentioned this pull request Feb 7, 2022

Rewrite accelerator_connector #11448

Merged

12 tasks

mergify bot removed the has conflicts label Feb 8, 2022

ananthsub force-pushed the feat/gpu-validation branch from 8f704a2 to a20c0ae Compare February 8, 2022 06:17

mergify bot added has conflicts and removed has conflicts labels Feb 8, 2022

ananthsub force-pushed the feat/gpu-validation branch from 353cf79 to 8362713 Compare February 8, 2022 06:43

mergify bot added has conflicts and removed has conflicts labels Feb 8, 2022

ananthsub force-pushed the feat/gpu-validation branch from 946c175 to 569fb6c Compare February 8, 2022 07:11

rohitgr7 approved these changes Feb 8, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Outdated Show resolved Hide resolved

ananthsub added 3 commits February 9, 2022 01:48

is_available

f58e582

Update test_gpu.py

2002944

tests

9cc7c00

ananthsub force-pushed the feat/gpu-validation branch from e2dcbfe to 9cc7c00 Compare February 9, 2022 09:48

mergify bot removed the has conflicts label Feb 9, 2022

ananthsub added 2 commits February 9, 2022 08:47

Update test_dp.py

c1ff5ab

Update gpu.py

7a3702b

ananthsub requested review from carmocca and rohitgr7 February 9, 2022 17:37

justusschock approved these changes Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Outdated Show resolved Hide resolved

Update gpu.py

946ad8a

rohitgr7 previously approved these changes Feb 9, 2022

View reviewed changes

rohitgr7 reviewed Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Show resolved Hide resolved

rohitgr7 reviewed Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/tpu.py Show resolved Hide resolved

carmocca reviewed Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/accelerator.py Outdated Show resolved Hide resolved

pytorch_lightning/accelerators/cpu.py Show resolved Hide resolved

pytorch_lightning/accelerators/ipu.py Show resolved Hide resolved

tests/plugins/test_amp_plugins.py Show resolved Hide resolved

carmocca reviewed Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/accelerator.py Outdated Show resolved Hide resolved

awaelchli approved these changes Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/gpu.py Show resolved Hide resolved

tests/accelerators/test_cpu.py Outdated Show resolved Hide resolved

tests/accelerators/test_ddp.py Outdated Show resolved Hide resolved

address comments

e2d4432

carmocca mentioned this pull request Feb 9, 2022

feat(wandb): support distributed modes #11650

Merged

12 tasks

ananthsub added 2 commits February 9, 2022 13:36

Update accelerator.py

d6e3248

Update CHANGELOG.md

2a111dc

rohitgr7 reviewed Feb 9, 2022

View reviewed changes

pytorch_lightning/accelerators/cpu.py Show resolved Hide resolved

rohitgr7 approved these changes Feb 9, 2022

View reviewed changes

ananthsub mentioned this pull request Feb 9, 2022

Where and when should device availability checks happen? #11831

Closed

ananthsub merged commit 1b107c5 into Lightning-AI:master Feb 9, 2022

ananthsub deleted the feat/gpu-validation branch February 9, 2022 23:11

jjenniferdai mentioned this pull request Feb 25, 2022

add accelerator.is_available() check #12104

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `Accelerator.is_available()` interface requirement #11797

Add `Accelerator.is_available()` interface requirement #11797

Uh oh!

ananthsub commented Feb 7, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

carmocca commented Feb 9, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ananthsub commented Feb 9, 2022

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add Accelerator.is_available() interface requirement #11797

Add Accelerator.is_available() interface requirement #11797

Uh oh!

Conversation

ananthsub commented Feb 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Does your PR introduce any breaking changes? If yes, please list them.

Before submitting

PR review

Did you have fun?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

carmocca commented Feb 9, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ananthsub commented Feb 9, 2022

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add `Accelerator.is_available()` interface requirement #11797

Add `Accelerator.is_available()` interface requirement #11797

ananthsub commented Feb 7, 2022 •

edited

Loading