Add examples to transform docs #1426

sotte · 2019-10-07T19:44:26Z

This PR is the result of #1409. @fmassa thanks for the input!

The PR adds examples of each transform class to the sphinx docs. The result looks something like this:

Make `_sample_image()` public

Also, the Grace Hopper image is available under test/assets I believe, which is not installed with torchvision, so this would mean that we would need to move the image somewhere else. My thinking is that those two functions are very specific to the examples, and thus having them live in torchvision.utils as being public functions might not be best.

I would argue that it's quite convenient to be able to get sample image and therefore sample_image() should be public. skimage for example gives you quite a few sample images which I actually tend to use when using pytorch.
(That being said I'm also fine with a private _sample_image().)

Implementation of `_sample_image()` public

The current implementation is just a dummy implementation that load the image from /tmp :)
The Hopper image is part of the test/ folder and is not part of the resulting egg/wheel/whatever.
Normally I would just use https://docs.python.org/3.7/library/importlib.html#module-importlib.resources to load the file, but pytorch is not 3.7 only :) So we have to decide on where to put the file and how to load it. I'm no expert when it comes to packaging and I'm more than open for suggestion.

TODOs

Add examples to transform classes (as suggested)
Add helpers to load and display images
Split transform.rst into transform.rst and transform_functional.rst (as suggested)
Decide if _sample_image() really should de private
Add proper implementation for _sample_image() depending on how the actual sample image is going to be shipped.
Fix inconsistencies of of documented transforms.
Document all __call__ of TensorTransforms.
Clean git history.

Feedback welcome!

Closes #1409

- All "Transforms on torch.*Tensor" document their __call__ in a consistent manner and __call__ is shown in sphinx. - Fixed typos, punctuation, and naming. - Fixed wrong format of doc strings.

codecov-io · 2019-10-14T08:21:35Z

Codecov Report

Merging #1426 into master will decrease coverage by 0.1%.
The diff coverage is 25%.

@@            Coverage Diff             @@
##           master    #1426      +/-   ##
==========================================
- Coverage   64.08%   63.97%   -0.11%     
==========================================
  Files          80       80              
  Lines        6328     6343      +15     
  Branches      973      975       +2     
==========================================
+ Hits         4055     4058       +3     
- Misses       1986     1998      +12     
  Partials      287      287

Impacted Files	Coverage Δ
torchvision/transforms/transforms.py	`81.04% <ø> (ø)`	⬆️
torchvision/utils.py	`55.88% <25%> (-10.16%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ed5b2dc...7d6d38c. Read the comment docs.

fmassa

Thanks a lot for the PR @sotte!

I have a few comments, let me know what you think.

In particular, I'm thinking if we could just define the methods in the documentation somewhere, so that they do not live in the torchvision source.

torchvision/transforms/transforms.py

docs/source/transforms.rst

fmassa · 2019-10-18T13:00:54Z

Another thing, we should not forget to fix the random seeds (for both random and torch) to avoid the images being regenerated at every doc rebuild.

Report their input/output shapes.

sotte · 2019-10-19T08:14:05Z

Another thing, we should not forget to fix the random seeds (for both random and torch) to avoid the images being regenerated at every doc rebuild.

The examples are only re-executed if the example code changes, i.e. calling make html twice in a row does not rebuild the images. Therefore I'm not not sure if he even have to fix the seeds. @fmassa if only recreating the images on every make html is the issue, then we don't need to fix the seeds. What do you think.

sotte · 2019-10-19T08:20:16Z

We need to decide on the actual implementation of _example_image(), where the image is stored, and how the image should be packed into the wheel.

Just as reference: this is how skimage does it:

Packing: https://github.com/scikit-image/scikit-image/blob/master/setup.py#L160
Loading the images: https://github.com/scikit-image/scikit-image/blob/master/skimage/data/__init__.py#L78

fmassa · 2019-10-25T13:26:44Z

The examples are only re-executed if the example code changes, i.e. calling make html twice in a row does not rebuild the images

I might not understand how matplotlib caching works, but I would have expected that if I clean up the build folder and re-run again make html the images generated would be different? Which would mean the html would change everytime we update the documentation.

We need to decide on the actual implementation of _example_image(), where the image is stored, and how the image should be packed into the wheel.

thanks for the pointers. I'm not yet clear if we want to have those functions in the torchvision distribution, but the pointers are anyway very helpful.

I'm waiting on feedback from @jlin27 on what would be better.

jlin27 · 2019-10-25T21:51:07Z

Hi @fmassa @sotte

My vote is to keep _sample_image() private, given as you've all discussed, that it's specific for this example use case. Helps signal to users that they probably shouldn't be using it outside of this situation.

As for where to store the images, I'd suggest a static folder with a descriptive name. For example, in pytorch/tutorials, images are stored /static/_img (https://github.com/pytorch/tutorials/tree/master/_static/images), and just make sure to double-check that they are pulled in properly during build.

sotte · 2019-10-27T08:54:25Z

Seeds
@fmassa I think we were talking about different things. When you remove the build/ folder of the docs, of course the docs will be rebuild from scratch and the output might be different depending on the RNG. If you don't remove the build/ folder then sphinx is not going to re-execute the examples.
I set the random seed now and the examples should stay constant between complete rebuilds.

Private functions
I explicitly mention that the functions are not to be used.

Loading and packaging the image / things that should be simple but are quite tricky
@jlin27 thanks for the input! I think bundling the images is a bit more complex as in the pytorch tutorials case. We have to ship the assets.
In general I would add the image to a newly created torchvision.assets module so that it gets shipped with torchvision and so that I can access the file similar to how it's done with skimage. Something like this:

def _sample_image():
    """Private helper function to load a sample PIL image.

    This function might change and/or break. Don't depend on it.
    """
    import os.path
    from PIL import Image

    data_dir = os.path.abspath(os.path.dirname(__file__))
    return Image.open(os.path.join(data_dir, "assets", "grace_hopper_517x606.jpg"))

Then of course we have to actually include the folder

# MANIFEST.in
include README.rst
include LICENSE
include torchvision/assets/*

recursive-exclude * __pycache__
recursive-exclude * *.py[co]

and change the

# setup.py
setup(
    ...
    zip_safe=False,  # not zip safe because we load the image via a path
    include_package_data=True,
)

but after doing so I get the following error:

➤ python setup.py install 
Building wheel torchvision-0.5.0a0+5108c1e
running install
running bdist_egg
running egg_info
writing torchvision.egg-info/PKG-INFO
writing dependency_links to torchvision.egg-info/dependency_links.txt
writing requirements to torchvision.egg-info/requires.txt
writing top-level names to torchvision.egg-info/top_level.txt
reading manifest file 'torchvision.egg-info/SOURCES.txt'
reading manifest template 'MANIFEST.in'
warning: no previously-included files matching '__pycache__' found under directory '*'
warning: no previously-included files matching '*.py[co]' found under directory '*'
writing manifest file 'torchvision.egg-info/SOURCES.txt'
installing library code to build/bdist.linux-x86_64/egg
running install_lib
running build_py
copying torchvision/version.py -> build/lib.linux-x86_64-3.7/torchvision
error: Error: setup script specifies an absolute path:

    /home/stefan/coding/torchvision/test/test_models.cpp

setup() arguments must *always* be /-separated paths relative to the
setup.py directory, *never* absolute paths.

I'm also not sure if we break things by making torchvision not zip_safe.

The packaging problem is still not solved and I'm more than open for input!

Ref
See the "note" box: https://python-packaging.readthedocs.io/en/latest/non-code-files.html

fmassa · 2019-11-04T10:24:45Z

Hi @sotte ,

Sorry for the delay in replying, I had to fix a few issues with torchvision lately (including making it zip_safe=False :-), see #1536 )

About the error you are facing, from a quick search it might indicate that you might need to do a clean build for torchvision?

Apart from that, I think that the approach you are following is the right one, with the os.path.abspath(os.path.dirname(__file__)) and setting include_package_data.

I don't have much more experience with setuptools than that though

sotte · 2019-11-04T10:26:49Z

@fmassa no worries and thanks for the pointer. I'll give it a try and get back to you.

sotte · 2019-11-04T20:22:12Z

It's getting tricky :) Starting with a fresh build did not help.

Here is a log of what I did:

# Add the following to setup.py
#     zip_safe=False,
#     include_package_data=True,
python setup.py clean
python setup.py install

This yields the error:

...
warning: no previously-included files matching '__pycache__' found under directory '*'
warning: no previously-included files matching '*.py[co]' found under directory '*'
writing manifest file 'torchvision.egg-info/SOURCES.txt'
error: Error: setup script specifies an absolute path:

    /home/stefan/coding/torchvision/test/test_models.cpp

setup() arguments must *always* be /-separated paths relative to the
setup.py directory, *never* absolute paths.

Checking torchvision.egg-info/SOURCES.txt tells me that all *.cpp files are included with their absolute path:

$ cat torchvision.egg-info/SOURCES.txt | grep cpp                                                      
/home/stefan/coding/torchvision/test/test_models.cpp
/home/stefan/coding/torchvision/torchvision/csrc/vision.cpp
/home/stefan/coding/torchvision/torchvision/csrc/cpu/ROIAlign_cpu.cpp
/home/stefan/coding/torchvision/torchvision/csrc/cpu/ROIPool_cpu.cpp
/home/stefan/coding/torchvision/torchvision/csrc/cpu/nms_cpu.cpp
...

get_extensions() in setup.py seems to create absolute paths (line 80). I haven't had time to look into it more thoroughly.

sotte · 2019-11-04T20:26:37Z

Just to verify that I did not break anything in my branch: on master cd17484 when adding include_package_data=True I get the same error when calling python setup.py build.

The SOURCES.txt contained absolute paths which is not valid if you include_package_data. Therefore we switched to relpath.

sotte · 2019-11-10T14:38:46Z

@fmassa So I had to change the setup.py quite a bit (and I don't think this should be necessary) to be able to build torchvision when include_package_data=True. Before my commit 7d6d38c the SOURCES.txt contained absolute paths which should not be the case, but seemingly only produces errors when include_package_data=True. I'm quite confident that I broke the build with that commit though :)

Please, anybody who is more knowledgeable with python packaging take a look at this!

That being said, the PR should be complete now. Feel free to build the docs yourself and let me know if it's ok.

sotte · 2020-01-09T08:41:37Z

@fmassa Please verify that it works on you end. If so I think we can merge this one (after rebasing).

fmassa · 2020-01-09T20:23:21Z

Hi @sotte

Very sorry for the delay in coming back to you.

I'll get this reviewed and merged tomorrow, thanks a lot!

sotte · 2020-01-10T11:10:54Z

Absolutely no problem. Just let me know if you need anything else.

sotte · 2020-03-04T16:38:36Z

@fmassa friendly ping :)

Let me know what I can do to get this into masteri

fmassa · 2020-03-30T15:31:56Z

Very sorry for the delay @sotte! Was very busy working on a paper submission.

I'm putting some time aside to review this tomorrow, thanks for the patience!

vincentqb

Overall, this looks good to me. The functions should definitely be private for now, since we can always decide to make them public at a later time.

The tests are failing to build though.

Building wheels for collected packages: torchvision
  Building wheel for torchvision (setup.py) ... error
  ERROR: Command errored out with exit status 1:

docs/source/transforms.rst

torchvision/transforms/transforms.py

…to_transform_docs

sotte · 2020-06-08T18:11:42Z

@fmassa I was not able to build when using abspath and (I think) I was getting error messages similar to the current ones of circleCI: https://app.circleci.com/pipelines/github/pytorch/vision/2750/workflows/978a424b-8d15-4271-9ff5-df6cc9fa1360/jobs/151904/steps

setup() arguments must *always* be /-separated paths relative to the
setup.py directory, *never* absolute paths.

NicolasHug · 2021-04-19T14:51:36Z

Closing since #3652 has been merged.

Thanks a lot for the proposal and for initiating the work on this PR @sotte !

I opened #3688 as a follow-up

sotte added 2 commits October 7, 2019 19:43

Configure sphinx to display matplotlib plots

7f43913

Add helper functions: _plot_images and _sample_image

a92c165

sotte mentioned this pull request Oct 7, 2019

Add examples of transformation to docs #1409

Closed

sotte added 3 commits October 14, 2019 09:01

Add examples to all transform classes

14e6927

Split transforms.rst

048d974

Fix inconsistencies and typos in transforms.py

fa9e946

- All "Transforms on torch.*Tensor" document their __call__ in a consistent manner and __call__ is shown in sphinx. - Fixed typos, punctuation, and naming. - Fixed wrong format of doc strings.

sotte force-pushed the add_examples_to_transform_docs branch from 824a8fd to fa9e946 Compare October 14, 2019 08:21

fmassa reviewed Oct 14, 2019

View reviewed changes

Document __call__ of TensorTransforms consistently

b9a7678

Report their input/output shapes.

sotte added 2 commits October 27, 2019 08:04

Fix seeds in transform examples

5108c1e

Load image from torchvision.assets (not working yet)

ad79607

Use relpaths in setup.py

7d6d38c

The SOURCES.txt contained absolute paths which is not valid if you include_package_data. Therefore we switched to relpath.

sotte changed the title ~~[WIP] Add examples to transform docs~~ Add examples to transform docs Jan 9, 2020

Merge branch 'master' into add_examples_to_transform_docs

f136767

vincentqb suggested changes Apr 23, 2020

View reviewed changes

docs/source/transforms.rst Show resolved Hide resolved

torchvision/transforms/transforms.py Outdated Show resolved Hide resolved

torchvision/transforms/transforms.py Outdated Show resolved Hide resolved

torchvision/transforms/transforms.py Outdated Show resolved Hide resolved

vincentqb and others added 6 commits April 23, 2020 11:42

lint

ce2174c

Reverst docstring changes as vincentqb suggested

17db875

Merge branch 'master' of github.com:pytorch/vision into add_examples_…

84b0ac1

…to_transform_docs

Update mypi to not consider numpy and matplotlib

3a298b4

Try reverting changes to get_extension

f5148a4

Try revert change to cwd

868f388

facebook-github-bot added the cla signed label Oct 30, 2020

NicolasHug mentioned this pull request Apr 16, 2021

Add illustrations of transforms with sphinx-gallery #3652

Merged

NicolasHug closed this Apr 19, 2021

Add examples to transform docs #1426

Add examples to transform docs #1426

Uh oh!

Conversation

sotte commented Oct 7, 2019 • edited by vincentqb Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Make _sample_image() public

Implementation of _sample_image() public

TODOs

Uh oh!

codecov-io commented Oct 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fmassa commented Oct 18, 2019

Uh oh!

sotte commented Oct 19, 2019

Uh oh!

sotte commented Oct 19, 2019

Uh oh!

fmassa commented Oct 25, 2019

Uh oh!

jlin27 commented Oct 25, 2019

Uh oh!

sotte commented Oct 27, 2019

Uh oh!

fmassa commented Nov 4, 2019

Uh oh!

sotte commented Nov 4, 2019

Uh oh!

sotte commented Nov 4, 2019

Uh oh!

sotte commented Nov 4, 2019

Uh oh!

sotte commented Nov 10, 2019

Uh oh!

sotte commented Jan 9, 2020

Uh oh!

fmassa commented Jan 9, 2020

Uh oh!

sotte commented Jan 10, 2020

Uh oh!

sotte commented Mar 4, 2020

Uh oh!

fmassa commented Mar 30, 2020

Uh oh!

vincentqb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sotte commented Jun 8, 2020

Uh oh!

NicolasHug commented Apr 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sotte commented Oct 7, 2019 •

edited by vincentqb

Loading

Make `_sample_image()` public

Implementation of `_sample_image()` public

codecov-io commented Oct 14, 2019 •

edited

Loading