fix padding for degenerate segmentation masks #6542

pmeier · 2022-09-07T07:57:45Z

Before this PR, padding degenerate segmentation masks failed:

import torch
from torchvision.prototype.transforms import functional as F
from torchvision.prototype import features

mask = features.SegmentationMask(torch.randint(0, 2, (0, 937, 1024), dtype=torch.bool))
F.pad(mask, 5)

RuntimeError: cannot reshape tensor of 0 elements into shape [-1, 0, 937, 1024] because the unspecified dimension size -1 can be any value and is ambiguous

I fixed the kernel and extended the kernel tests for all input types (images, segmentation masks, bounding boxes, ...) to also check the degenerate case.

pmeier · 2022-09-07T07:58:37Z

torchvision/prototype/transforms/functional/_geometry.py

 def pad_segmentation_mask(
    segmentation_mask: torch.Tensor, padding: Union[int, List[int]], padding_mode: str = "constant"
 ) -> torch.Tensor:
-    num_masks, height, width = segmentation_mask.shape[-3:]


Not sure why we replicated pad_image_tensor here if the only difference is that we set a fixed fill=0.

This reverts commit 18c5e4f.

pmeier

Apart from the ones I fixed, there are more operators that use the .view(-1, ...) idiom.

vision/torchvision/prototype/transforms/functional/_geometry.py

Line 227 in 4c073b0

img = img.view(-1, num_channels, height, width)
vision/torchvision/prototype/transforms/functional/_geometry.py

Line 448 in 4c073b0

img = img.view(-1, num_channels, height, width)

I've xfailed some tests for them. Plus, there are more bounding box ops that do this. But we don't need to worry about them, because .view(-1, ...) only fails if there is a 0 in the ... part. For bounding boxes we only do

>>> t = torch.empty(0, 4)
>>> t.view(-1, 4)
tensor([], size=(0, 4))

The ambiguity only arises for segmentation masks, which use the channel dimension as number of objects and that can be 0.

>>> t = torch.empty(4, 0, 16, 16)
>>> t.view(-1, 0, 16, 16)
RuntimeError: cannot reshape tensor of 0 elements into shape [-1, 0, 16, 16] because the unspecified dimension size -1 can be any value and is ambiguous

That might be a good reason to have something in the kernel other than just calling the image kernel. But we can discuss this later.

pmeier · 2022-09-07T15:24:31Z

test/test_prototype_transforms.py

+        # transforms.RandomRotation(degrees=(-45, 45)),
+        # transforms.RandomAffine(degrees=(-45, 45)),


They don't work since the new input data generation also adds degenerate shapes and that is not supported yet by the underlying kernels.

pmeier · 2022-09-07T15:25:43Z

test/test_prototype_transforms.py

-        masks = make_segmentation_mask((32, 24))
-        ohe_masks = features.SegmentationMask(torch.randint(0, 2, size=(6, 32, 24)))
-        sample = [image, bboxes, label, ohe_label, masks, ohe_masks]
+        masks = make_segmentation_mask((32, 24), num_objects=6)


We currently only work with object detection masks. They were called ohe_masks here before. I've renamed them and removed the tests for what was masks before.

pmeier · 2022-09-07T15:26:31Z

test/test_prototype_transforms_functional.py

-    size = size or torch.randint(16, 33, (2,)).tolist()
-    shape = (*extra_dims, 1, *size)
-    data = make_tensor(shape, low=0, high=num_categories, dtype=dtype)
+def make_segmentation_mask(size=None, *, num_objects=None, extra_dims=(), dtype=torch.uint8):


The functionality generated masks for segmentation tasks, which we currently don't support.

pmeier · 2022-09-07T15:29:48Z

test/test_prototype_transforms_functional.py

+incorrect_expected_segmentation_mask_setup = pytest.mark.xfail(
+    reason="This test fails because the expected result computation is wrong. Fix ASAP.",
+    strict=False,
+)


This is to get the CI green for now. As the text implies we should get to this ASAP. But doing it now would slow down the reference training.

vfdev-5

OK to me

vfdev-5 · 2022-09-07T15:41:11Z

torchvision/prototype/transforms/functional/_geometry.py

-    padded_image = _FT.pad(
-        img=img.view(-1, num_channels, height, width), padding=padding, fill=fill, padding_mode=padding_mode
-    )
+    left, right, top, bottom = _FT._parse_pad_padding(padding)


Nit: this part of code can be put into else part to avoid parsing twice for normal case

datumbox

LGTM. I'm pretty sure we don't need to clone but let's check this separately.

Summary: * fix padding for degenerate segmentation masks * extend test data degeneration to degenerate inputs * add even more degenerate shapes * simplify kernel * [SKIP CI] only GHA * add more degenerate segmentation masks * fix segmentation mask generation * xfail some tests * Revert "simplify kernel" This reverts commit 18c5e4f. * fix resize for degenerate inputs * [SKIP CI] CircleCI * fix RandomIoUCrop test * [SKIP CI] CircleCI * cleanup * [SKIP CI] CircleCI * add perf TODO comments * [SKIP CI] CircleCI Reviewed By: YosuaMichael Differential Revision: D39381957 fbshipit-source-id: 05a0da7ead77f33bc7ec7423271693a9aef6ad7e

pmeier added 2 commits September 7, 2022 09:53

fix padding for degenerate segmentation masks

5e5a0aa

extend test data degeneration to degenerate inputs

5ddbdf8

pmeier added bug module: transforms prototype labels Sep 7, 2022

pmeier requested review from datumbox and vfdev-5 September 7, 2022 07:57

facebook-github-bot added the cla signed label Sep 7, 2022

pmeier commented Sep 7, 2022

View reviewed changes

pmeier added 11 commits September 7, 2022 10:03

add even more degenerate shapes

c3f07a1

simplify kernel

18c5e4f

[SKIP CI] only GHA

10f2b9e

add more degenerate segmentation masks

bdfd96e

fix segmentation mask generation

81b4ffb

xfail some tests

8a099e2

Revert "simplify kernel"

a4ea809

This reverts commit 18c5e4f.

fix resize for degenerate inputs

3222dfa

[SKIP CI] CircleCI

d67e459

fix RandomIoUCrop test

830aac5

[SKIP CI] CircleCI

429d507

pmeier commented Sep 7, 2022

View reviewed changes

pmeier added 2 commits September 7, 2022 17:36

cleanup

b010ad4

[SKIP CI] CircleCI

235d407

vfdev-5 approved these changes Sep 7, 2022

View reviewed changes

vfdev-5 reviewed Sep 7, 2022

View reviewed changes

pmeier added 2 commits September 7, 2022 17:42

add perf TODO comments

0662413

[SKIP CI] CircleCI

182edb7

datumbox approved these changes Sep 7, 2022

View reviewed changes

pmeier merged commit 84dcf69 into pytorch:main Sep 7, 2022

pmeier deleted the pad-bbox-degenerate branch September 7, 2022 15:49

pmeier mentioned this pull request Sep 8, 2022

Cleanup prototype kernels for degenerate inputs #6544

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix padding for degenerate segmentation masks #6542

fix padding for degenerate segmentation masks #6542

Uh oh!

pmeier commented Sep 7, 2022

Uh oh!

pmeier Sep 7, 2022

Uh oh!

pmeier left a comment

Uh oh!

pmeier Sep 7, 2022

Uh oh!

pmeier Sep 7, 2022

Uh oh!

pmeier Sep 7, 2022

Uh oh!

pmeier Sep 7, 2022

Uh oh!

vfdev-5 left a comment

Uh oh!

vfdev-5 Sep 7, 2022

Uh oh!

datumbox left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		# transforms.RandomRotation(degrees=(-45, 45)),
		# transforms.RandomAffine(degrees=(-45, 45)),

fix padding for degenerate segmentation masks #6542

fix padding for degenerate segmentation masks #6542

Uh oh!

Conversation

pmeier commented Sep 7, 2022

Uh oh!

pmeier Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier left a comment

Choose a reason for hiding this comment

Uh oh!

pmeier Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

pmeier Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

vfdev-5 left a comment

Choose a reason for hiding this comment

Uh oh!

vfdev-5 Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

datumbox left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants