Skip to content

WandbLogger in DDP setting creates multiple runs in W&B #11380

@konstantinjdobler

Description

@konstantinjdobler

When using the WandbLogger in DDP mode with n>1 GPUs, n runs in Weights&Biases will be created. Only the first run will have the actual logs, all other runs will be empty, essentially cluttering up the W&B project.

Expected behavior

Only one run will be created, even when using DDP on multiple GPUs.

Environment

Example config, but this issue also happens on other configurations with different GPU models:

  • CUDA:
    • GPU:
      • NVIDIA A100-SXM-80GB
      • NVIDIA A100-SXM-80GB
    • available: True
    • version: 11.0
  • Packages:
    • numpy: 1.22.0
    • pyTorch_debug: False
    • pyTorch_version: 1.10.0
    • pytorch-lightning: 1.5.8
    • tqdm: 4.62.3
  • System:
    • OS: Linux
    • architecture:
      • 64bit
      • ELF
    • processor: x86_64
    • python: 3.9.9
    • version: # 91-Ubuntu SMP Thu Jul 15 19:09:17 UTC 2021

Additional Info

Since this bug only happens in DDP with multiple GPUs, I can't reproduce it in Colab.

cc @awaelchli @morganmcg1 @AyushExel @borisdayma @scottire

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinglogger: wandbWeights & Biases

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions