extend DistributedSampler to support group_size #1512

stephenyan1231 · 2019-10-22T06:51:08Z

summary

For video model evaluation, we sample N clips from a video, and average clip predictions to get a video-level prediction.
Assume, we sample 2 clips per video. The test dataset, which has 4 videos {A,B,C,D} is illustrated below.

[A_0, A_1, B_0, B_1, C_0, C_1, D_0, D_1]

Assume we have 2 gpus. The existing DistributedSampler will distribute clips from the same video to different gpus, and make it difficult to average clip predictions.

GPU 0: 

       [A_0, B_0, C_0, D_0]

GPU 1: 

       [A_1, B_1, C_1, D_1]

We extend ShardDataset to support an optional argument group_size. When group_size=2, which will shard clips below.

GPU 0: 

        [A_0, A_1, B_0, B_1]

GPU 1: 

        [C_0, C_1, D_0, D_1]

This facilitates the averaging of clip predictions.

Unit test

python test/test_datasets_samplers.py

codecov-io · 2019-10-22T08:53:16Z

Codecov Report

Merging #1512 into master will increase coverage by 0.31%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #1512      +/-   ##
==========================================
+ Coverage   64.34%   64.66%   +0.31%     
==========================================
  Files          83       83              
  Lines        6454     6461       +7     
  Branches      992      992              
==========================================
+ Hits         4153     4178      +25     
+ Misses       2006     1984      -22     
- Partials      295      299       +4

Impacted Files	Coverage Δ
torchvision/datasets/samplers/clip_sampler.py	`79.54% <100%> (+23.98%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 937c83a...916801c. Read the comment docs.

fmassa

LGTM, thanks a lot for the PR Zhicheng!

* extend DistributedSampler to support group_size * Fix lint

zyan3 and others added 2 commits October 21, 2019 23:48

extend DistributedSampler to support group_size

acb77a6

Fix lint

916801c

fmassa approved these changes Oct 22, 2019

View reviewed changes

fmassa merged commit 355e9d2 into pytorch:master Oct 22, 2019

fmassa mentioned this pull request Oct 31, 2019

[v0.4.2] Release Tracker #1545

Closed

fmassa pushed a commit that referenced this pull request Oct 31, 2019

extend DistributedSampler to support group_size (#1512)

455a70e

* extend DistributedSampler to support group_size * Fix lint

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

extend DistributedSampler to support group_size #1512

extend DistributedSampler to support group_size #1512

Uh oh!

stephenyan1231 commented Oct 22, 2019

Uh oh!

codecov-io commented Oct 22, 2019 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

extend DistributedSampler to support group_size #1512

extend DistributedSampler to support group_size #1512

Uh oh!

Conversation

stephenyan1231 commented Oct 22, 2019

summary

Unit test

Uh oh!

codecov-io commented Oct 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-io commented Oct 22, 2019 •

edited

Loading