Speed up mean_filter2d with depthwise convolution

**System information**
- TensorFlow version (you are using): **tf-nightly-2.0-preview**
- TensorFlow Addons version: **source**
- Is it in the tf.contrib (if so, where): **no**
- Are you willing to contribute it (yes/no): **yes**
- Are you willing to maintain it going forward? (yes/no): **yes**

**Describe the feature and the current behavior/state.**

Currently, `tfa.image.mean_filter2d` is implemented by `tf.image.extract_patches`, while its functionality is the same with the one applying 2-D convolution with box filter (or uniform filter) channel by channel. This can be easily done by `tf.nn.depthwise_conv2d` with significant speed improvement.

Here is the [notebook example](https://colab.research.google.com/drive/1jJksZAkTfdjJIzPgdw9MhDkNQkFlUM6i) to compare the performance of these two implementations. On colab platform, **it is 4x faster than the original one for either single image or multiple images**. Note that I extend the original implementation with `tf.map_fn` to make it support 4-D input, and some trivial normalization is omitted.

Still wondering why some normalization and casting are performed in the original implementation in both `mean_filter2d` and `median_filter2d`. (no offense, just want to know why)

- For [opencv](https://docs.opencv.org/4.1.0/d4/d86/group__imgproc__filter.html#gad533230ebf2d42509547d514f7d3fbc3),
> dst: destination array of the same size and type as src.
- For [scipy](https://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.uniform_filter.html), 
> output : array or dtype, optional
The array in which to place the output, or the dtype of the returned array. By default an array of the same dtype as input will be created.
- For [matlab](https://www.mathworks.com/help/images/ref/medfilt2.html),
> Output image, returned as a numeric matrix of the same class as the input image I.

As far as I'm concerned, there is no need to force the computation limited in range [0, 1], and it's not necessary to transform the output back to `uint8` range even if the input is not of type `uint8`. **All we need to do is to compute the output (may first cast image to float because `depthwise_conv2d` does not accept non-float kernel and input) and cast it back to the original data type.** I believe that for median and average operation, there will not result in any over/underflow situation. Hence, it might be very safe for casting the output back to the original data type, though I think it's better to discuss this first.

Moreover, if users want to do some post processing on the intermediate feature maps from CNNs, these feature maps might not range in either [0, 1] or `uint8`. They are more likely to tensors of real number depending on what the activation function is used.

**Will this change the current api? How?**

`mean_filter2d(image, filter_shape=(3, 3), padding="REFLECT", constant_values=0, name=None)`

The new API is similar to [scipy.ndimage.uniform_filter](https://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.uniform_filter.html) which supports padding mode. The default padding mode is "REFLECT" because not only scipy but also [opencv](https://docs.opencv.org/4.1.0/d4/d86/group__imgproc__filter.html#gad533230ebf2d42509547d514f7d3fbc3) set REFLECT padding as a default (while some toolboxes in matlab adopt zero-padding by default). Besides, [tf.image.sobel_edges](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/image_ops_impl.py#L3183), a filter-based application, also uses REFLECT padding mode.

**Who will benefit with this feature?**

People who want to process images.

**Any Other info.**

To conclude:

1. Implement with `tf.nn.depthwise_conv2d`, which can speed up a lot and support batch-wise computation.
2. Cast the input image to float if needed and cast the output back to `image.dtype`.
3. Support padding modes.

Some references for API design:
- [tf.pad](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/pad)
- [scipy.ndimage.uniform_filter](https://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.uniform_filter.html)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up mean_filter2d with depthwise convolution #234

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Speed up mean_filter2d with depthwise convolution #234

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions