Avoid multiple pytree flattenings inside the prototype transforms

By design, our transforms v2 can handle arbitrary input structures. Internally, we are using `torch.utils._pytree` for it.

## Status quo

The simplest transforms flatten / unflatten only once:

https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_transform.py#L37-L42

However, albeit hidden, most transforms flatten at least twice if not multiple times:

- If a transform needs to know the spatial size to compute its `params`

  https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_geometry.py#L108 

  the extraction logic flattens again

  https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_utils.py#L101-L102

- If a transform performs some checks on the sample before transforming 
  
  https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_geometry.py#L188-L190

  the checking utility flattens again

  https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_utils.py#L124-L125

This of course has some performance implications that can be avoided.

## Proposal

In all of the cases where we perform the extra flattening are happening in internal and thus not user-facing methods. Thus, instead of keeping the option to operate on arbitrary input structures on our utilities, we could have them just work on already flattened inputs. This would avoid the repeated `tree_flatten` calls inside of them.

For our transforms that means two changes:

1. The extra calls in `_get_params` can be avoided by simply flattening before its call

   https://github.com/pytorch/vision/blob/54a2d4e8f7a4568823532d4342f6ba13e7339dce/torchvision/prototype/transforms/_transform.py#L35-L37

2. The extra calls in overridden `forward`'s require more boilerplate code. Basically, each transform that overrides `forward` needs to perform the flattening / unflattening themselves since the check utilities are called before the `super().forward(...)` call. This also means that we are technically still flattening twice, although the second time inside `super().forward(...)` does nothing.

   However, there is #6503 that introduces a common interface for the checks. IIRC, we never followed up on it, since we eliminated some boilerplate in the overridden `forward` in #6504. Since this proposal would re-add some boilerplate for performance gains, we could pick #6503 up again. If we do, this leaves very few, objectively "outlier" transforms that would need to have this boiler plate. 

   If we go for the common check interface, it could receive the already flattened sample as well similar to what was proposed in 1.

## Conclusion

Flattening an input sample multiple times inside a single transformation has no benefits while slowing down execution. This issue proposes a way to avoid this while keeping the UI as convenient as it is.

cc @vfdev-5 @datumbox @bjuncek

	flat_inputs, spec = tree_flatten(sample)
	flat_outputs = [
	self._transform(inpt, params) if _isinstance(inpt, self._transformed_types) else inpt
	for inpt in flat_inputs
	]
	return tree_unflatten(flat_outputs, spec)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid multiple pytree flattenings inside the prototype transforms #6760

Status quo

Proposal

Conclusion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	def query_spatial_size(sample: Any) -> Tuple[int, int]:
	flat_sample, _ = tree_flatten(sample)

	if has_any(inputs, features.BoundingBox, features.Mask):
	raise TypeError(f"BoundingBox'es and Mask's are not supported by {type(self).__name__}()")
	return super().forward(*inputs)

	def has_any(sample: Any, *types_or_checks: Union[Type, Callable[[Any], bool]]) -> bool:
	flat_sample, _ = tree_flatten(sample)

	params = self._get_params(sample)

	flat_inputs, spec = tree_flatten(sample)

Avoid multiple pytree flattenings inside the prototype transforms #6760

Description

Status quo

Proposal

Conclusion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions