Preserve Datapoint subclasses instead of returning tensors #7807

NicolasHug · 2023-08-07T16:35:08Z

This PR addresses the "subclass unwrapping" issue from #7319.

We now always preserve the Datapoint type when doing native operations like img + 3 or img + some_tensor. This largely simplifies the Datapoint class implementation and avoid the potentially surprising "unwrapping" behaviour.

BoundingBoxes is the only class that needs a special treatment as it requires metadata, so it's the only class for which we override __torch_function__. Overall, the Datapoint logic is greatly simplified as it largely relies on the default ones from torch.Tensor.

Take a look at the newly-added test_operations() for an illustration of what is now possible.

Note: following #7807 (comment), the unwrapping / rewrapping mechanism in our functionals is preserved for perf reasons only.

pytorch-bot · 2023-08-07T16:35:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7807

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm CI upgrade in progress

❌ 1 New Failure, 1 Unrelated Failure

As of commit ec17580:

NEW FAILURE - The following job has failed:

build / linux-job (gh)

BROKEN TRUNK - The following job failed but were present on the merge base bf6a8dc:

👉 Rebase onto the `viable/strict` branch to avoid these failures

unittests-macos (3.8, macos-m1-12) / macos-job (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

test/test_transforms_v2_refactored.py

NicolasHug · 2023-08-07T16:44:09Z

test/test_transforms_v2_refactored.py

+    with pytest.raises(TypeError, match="unsupported operand type"):
+        img + mask


Users want to do that? Perfect, they'll need explicitly say what type they want as output by converting one of those operands to a tensor. We don't have to assume anything on their behalf and (surprisingly) return a pure tensor.

EDIT: as @pmeier pointed out offline, this is in fact the same behaviour as on main - nothing new

NicolasHug · 2023-08-07T16:45:16Z

test/test_transforms_v2_functional.py


-    output_boxes, output_canvas_size = F.resized_crop_bounding_boxes(in_boxes, format, top, left, height, width, size)
+    output_boxes, output_canvas_size = F.resized_crop_bounding_boxes(
+        in_boxes.as_subclass(torch.Tensor), format, top, left, height, width, size


This (and similar changes below) was needed because in_boxes is now still a BBox instance, and resized_crop_bounding_boxes expects a tensor (there is an error saying something like "if you pass a bbox, don't pass the format").

torchvision/datapoints/_datapoint.py

torchvision/transforms/v2/functional/_utils.py

NicolasHug · 2023-08-09T09:09:53Z

torchvision/prototype/transforms/_augment.py

        # Copy-paste masks:
        masks = masks * inverse_paste_alpha_mask
-        non_all_zero_masks = masks.sum((-1, -2)) > 0
+        non_all_zero_masks = (masks.sum((-1, -2)) > 0).as_subclass(torch.Tensor)


There was 2 other similar failures (below). The reason for the error is that (masks.sum((-1, -2)) > 0) is still a Mask object, and we can't use Masks as indices (line below).

This is the only kind of instance that I identified as potentially weird / confusing. But the error message is good enough to figure out the fix.
(In contrast, unwrapping all the time is likely to cause a lot more surprises and forces users to re-wrap all the time).

NicolasHug · 2023-08-09T16:05:05Z

test/test_transforms_v2_refactored.py

        assert _get_kernel(F.resize, MyDatapoint) is resize_my_datapoint
+
+
+def test_operations():


This test is mostly for illustrating the new behaviour. If we're OK with it, I'll refactor this test into something a little more polished

NicolasHug · 2023-08-17T10:34:57Z

Got superseded by #7825

NicolasHug added 3 commits August 7, 2023 15:55

move stuff out of CM

8f8f936

Call wrap_like for all exceptions

b1018a9

Get rid of __torchfunction__ and the whole wrapping/unwrapping logic

ddd88cd

facebook-github-bot added the cla signed label Aug 7, 2023

bbox tests

4e8b53d

NicolasHug commented Aug 7, 2023

View reviewed changes

torchvision/datapoints/_datapoint.py Outdated Show resolved Hide resolved

NicolasHug commented Aug 7, 2023

View reviewed changes

torchvision/transforms/v2/functional/_utils.py Outdated Show resolved Hide resolved

NicolasHug changed the title ~~Get rid of __torchfunction__ and the whole wrapping/unwrapping logic~~ Get rid of __torchfunction__ Aug 8, 2023

NicolasHug added 3 commits August 8, 2023 11:05

Put back wrapping / unwrapping in kernels

7471271

Merge branch 'main' of github.com:pytorch/vision into lajenfljanfeljnfe

c5b44a9

Fix tests

23b9704

NicolasHug commented Aug 9, 2023

View reviewed changes

preserve metadata on bboxes

f12fee1

NicolasHug commented Aug 9, 2023

View reviewed changes

NicolasHug added 2 commits August 9, 2023 17:10

Merge branch 'main' of github.com:pytorch/vision into lajenfljanfeljnfe

1962124

Merge branch 'main' of github.com:pytorch/vision into lajenfljanfeljnfe

e9c1173

NicolasHug changed the title ~~Get rid of __torchfunction__~~ Preserve Datapoint subclasses instead of returning tensors Aug 9, 2023

NicolasHug marked this pull request as ready for review August 9, 2023 16:36

NicolasHug added 2 commits August 10, 2023 10:25

mypy

854b01c

Merge branch 'main' of github.com:pytorch/vision into lajenfljanfeljnfe

ec17580

NicolasHug mentioned this pull request Aug 12, 2023

Allow users to choose whether to return Datapoint subclasses or pure Tensor #7825

Merged

NicolasHug closed this Aug 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preserve Datapoint subclasses instead of returning tensors #7807

Preserve Datapoint subclasses instead of returning tensors #7807

Uh oh!

NicolasHug commented Aug 7, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 7, 2023 •

edited

Loading

Uh oh!

Uh oh!

NicolasHug Aug 7, 2023 •

edited

Loading

Uh oh!

NicolasHug Aug 7, 2023

Uh oh!

Uh oh!

Uh oh!

NicolasHug Aug 9, 2023

Uh oh!

NicolasHug Aug 9, 2023

Uh oh!

NicolasHug commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		with pytest.raises(TypeError, match="unsupported operand type"):
		img + mask

		assert _get_kernel(F.resize, MyDatapoint) is resize_my_datapoint


		def test_operations():

Preserve Datapoint subclasses instead of returning tensors #7807

Preserve Datapoint subclasses instead of returning tensors #7807

Uh oh!

Conversation

NicolasHug commented Aug 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7807

❗ 1 Active SEVs

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

Uh oh!

NicolasHug Aug 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Aug 7, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NicolasHug Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

NicolasHug Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NicolasHug commented Aug 7, 2023 •

edited

Loading

pytorch-bot bot commented Aug 7, 2023 •

edited

Loading

NicolasHug Aug 7, 2023 •

edited

Loading