-
Notifications
You must be signed in to change notification settings - Fork 3.6k
add LSF support #4966
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add LSF support #4966
Conversation
|
Thanks!! Looks good, would you be able to confirm it works on your end here when you can? |
We need to do this so we don't send the model to the same device from multiple ranks
justusschock
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there a way to test this. I'm not fond to add anything to the Codename we cannot Test somehow in ci, since we are currently improving on that.
I think you would need access to a system that's managed by LSF. How does SLURM support get tested? |
|
Hello @ajtritt! Thanks for updating this PR.
Comment last updated at 2020-12-12 00:51:01 UTC |
|
Will open a new PR |
Any reason why this was closed? =) |
The final set of changes are more than the original PR stated. I could have edited the PR description, but that's a silent action and I didn't want the changes to be missed by reviewers. |
|
I think it would be completely fine to edit the description, reset all reviews, and comment that you extend the scope :] |
What does this PR do?
Add support for running PyTorch Lightning on systems managed by LSF.
Before submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
Before you start reviewing make sure you have read Review guidelines. In short, see the following bullet-list:
Did you have fun?
Always.