-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Closed
Labels
bugSomething isn't workingSomething isn't workingdata handlingGeneric data-related topicGeneric data-related topichelp wantedOpen to be worked onOpen to be worked on
Milestone
Description
🐛 Bug
The LightningDataModule prepare_data function should be able to be called on the local rank 0 process on any node if the user chooses, and there are APIs around controlling this, but they don't work. I've reproduced this bug and traced it to one (rather obvious) line:
This line wraps the prepare_data function in rank_zero_only, a function which means global zero not local zero. There is already logic in the data connector around making sure that the function is only run when it is supposed to. Apparently the multi-node setup was never tested?
This fix is simple: just delete the rank_zero_only part.
Additional context
This is a quick one-line fix that I'm happy to PR, but someone needs to engage with me on it so that it actually gets merged.
ananthsub and priancho
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingdata handlingGeneric data-related topicGeneric data-related topichelp wantedOpen to be worked onOpen to be worked on