-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Closed
Labels
Description
When using the WandbLogger in DDP mode with n>1 GPUs, n runs in Weights&Biases will be created. Only the first run will have the actual logs, all other runs will be empty, essentially cluttering up the W&B project.
Expected behavior
Only one run will be created, even when using DDP on multiple GPUs.
Environment
Example config, but this issue also happens on other configurations with different GPU models:
- CUDA:
- GPU:
- NVIDIA A100-SXM-80GB
- NVIDIA A100-SXM-80GB
- available: True
- version: 11.0
- GPU:
- Packages:
- numpy: 1.22.0
- pyTorch_debug: False
- pyTorch_version: 1.10.0
- pytorch-lightning: 1.5.8
- tqdm: 4.62.3
- System:
- OS: Linux
- architecture:
- 64bit
- ELF
- processor: x86_64
- python: 3.9.9
- version: # 91-Ubuntu SMP Thu Jul 15 19:09:17 UTC 2021
Additional Info
Since this bug only happens in DDP with multiple GPUs, I can't reproduce it in Colab.
SteshinSS