[video dataset]expose more arguments of VideoClips in video datasets #1310

stephenyan1231 · 2019-09-08T20:54:08Z

In __init__() method of class VideoClips, two arguments frame_rate and _precomputed_metadata are not yet exposed in dataset classes for hmdb51, ucf101 and kinetics400. This PR updates dataset classes to expose those 2 arguments.
Add a class VisionVideoDataset to abstract out API get_metadata(). The caller can retrieve video dataset metadata and decide how to cache it. This is dependent on a previous PR ([video reader] inception commit #1303)

…epath to video datasets

fmassa

Thanks for the PR!

I've made some inline comments, let me know what you think

torchvision/datasets/hmdb51.py

fmassa · 2019-09-11T13:44:49Z

torchvision/datasets/hmdb51.py


    def __init__(self, root, annotation_path, frames_per_clip, step_between_clips=1,
-                 fold=1, train=True, transform=None):
+                 frame_rate=None, precomputed_metadata=None, precomputed_metadata_filepath=None,


I would be ok exposing the _precomputed_metadata as a private argument in the constructor for now, and also expose the frame_rate, but I think that the precomputed_metadata_filepath and save_metadata_filepath should live outside of the dataset and be handled by the training code.

I agree. @fmassa

codecov-io · 2019-09-17T06:11:58Z

Codecov Report

❗ No coverage uploaded for pull request base (master@04f70c1). Click here to learn what that means.
The diff coverage is 50%.

@@            Coverage Diff            @@
##             master    #1310   +/-   ##
=========================================
  Coverage          ?   65.69%           
=========================================
  Files             ?       75           
  Lines             ?     5801           
  Branches          ?      888           
=========================================
  Hits              ?     3811           
  Misses            ?     1723           
  Partials          ?      267

Impacted Files	Coverage Δ
torchvision/datasets/vision.py	`46.87% <42.85%> (ø)`
torchvision/datasets/ucf101.py	`24.44% <50%> (ø)`
torchvision/datasets/hmdb51.py	`27.08% <50%> (ø)`
torchvision/datasets/kinetics.py	`33.33% <60%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 04f70c1...caa4a18. Read the comment docs.

fmassa · 2019-09-17T19:34:19Z

One thing that bothers me with the current approach is that there is no validation whatsoever that the precomputed_metadata and the samples (that have been computed from the files) do actually match anyhow.

I see a few possibilities:

enrich the metadata field from Dataset to contain samples as well (or some function of it), and assert that they have the same number of elements as video_clips (and make it start with an underscore, so _precomputed_metadata instead of precomputed_metadata
entirely pickle the Dataset instead of adding support for it to construct from metadata. leave to the training code to validate that they actually match
add a @classmethod called from_metadata that constructs dataset from the metadata. This keeps the constructor simpler, but will also require that we store more information in the metadata, probably the root, classes and samples features. In this case, the from_metadata classmethod would take as arguments metadata, transforms and maybe frames_per_clip and step_between_clips.

Thoughts?

stephenyan1231 · 2019-09-17T21:45:44Z

Since both PR 1303 and current PR 1310 modify common files such as video_utils.py, I close current PR 1310, and merge changes into PR 1303 for easier develop and code review. @fmassa

zyan3 added 2 commits September 8, 2019 13:51

[video dataset]expose more arguments of VideoClips in video datasets

2da4b63

add new arguments precomputed_metadata_filepath and save_metadata_fil…

42705eb

…epath to video datasets

fmassa requested changes Sep 11, 2019

View reviewed changes

zyan3 added 2 commits September 16, 2019 22:23

add VisionVideoDataset class. Address comments from fmassa

0364ec5

undo changes to video_utils.py

caa4a18

stephenyan1231 closed this Sep 17, 2019

stephenyan1231 reopened this Sep 17, 2019

stephenyan1231 closed this Sep 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[video dataset]expose more arguments of VideoClips in video datasets #1310

[video dataset]expose more arguments of VideoClips in video datasets #1310

Uh oh!

stephenyan1231 commented Sep 8, 2019 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

Uh oh!

Uh oh!

fmassa Sep 11, 2019

Uh oh!

stephenyan1231 Sep 17, 2019

Uh oh!

codecov-io commented Sep 17, 2019 •

edited

Loading

Uh oh!

fmassa commented Sep 17, 2019

Uh oh!

stephenyan1231 commented Sep 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[video dataset]expose more arguments of VideoClips in video datasets #1310

[video dataset]expose more arguments of VideoClips in video datasets #1310

Uh oh!

Conversation

stephenyan1231 commented Sep 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fmassa Sep 11, 2019

Choose a reason for hiding this comment

Uh oh!

stephenyan1231 Sep 17, 2019

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Sep 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa commented Sep 17, 2019

Uh oh!

stephenyan1231 commented Sep 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stephenyan1231 commented Sep 8, 2019 •

edited

Loading

codecov-io commented Sep 17, 2019 •

edited

Loading