-
Notifications
You must be signed in to change notification settings - Fork 7.2k
[video dataset]expose more arguments of VideoClips in video datasets #1310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR!
I've made some inline comments, let me know what you think
torchvision/datasets/hmdb51.py
Outdated
|
|
||
| def __init__(self, root, annotation_path, frames_per_clip, step_between_clips=1, | ||
| fold=1, train=True, transform=None): | ||
| frame_rate=None, precomputed_metadata=None, precomputed_metadata_filepath=None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would be ok exposing the _precomputed_metadata as a private argument in the constructor for now, and also expose the frame_rate, but I think that the precomputed_metadata_filepath and save_metadata_filepath should live outside of the dataset and be handled by the training code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree. @fmassa
Codecov Report
@@ Coverage Diff @@
## master #1310 +/- ##
=========================================
Coverage ? 65.69%
=========================================
Files ? 75
Lines ? 5801
Branches ? 888
=========================================
Hits ? 3811
Misses ? 1723
Partials ? 267
Continue to review full report at Codecov.
|
|
One thing that bothers me with the current approach is that there is no validation whatsoever that the I see a few possibilities:
Thoughts? |
|
Since both PR 1303 and current PR 1310 modify common files such as |
__init__()method of classVideoClips, two argumentsframe_rateand_precomputed_metadataare not yet exposed in dataset classes for hmdb51, ucf101 and kinetics400. This PR updates dataset classes to expose those 2 arguments.VisionVideoDatasetto abstract out APIget_metadata(). The caller can retrieve video dataset metadata and decide how to cache it. This is dependent on a previous PR ([video reader] inception commit #1303)