NXP backend: Replace pass to fuse activations functions with joint quantization with activation #14816

roman-janik-nxp · 2025-10-06T13:00:27Z

Summary

This PR replaces optimizations 'fuse_activation_functions.py' by quantization of Conv 2D and Linear ops together with fusable activations - selected activations supported by Neutron (Relu, Relu6, Sigmoid, Tanh). Logic is determined by target specs, now supporting Neutron-C. Tests updated. Relu has now non-shared, standalone quantization.

Test plan

Unit tests provided (test_edge_passes.py, test_quantizer.py).

cc @robert-kalmar @JakeStevens @digantdesai

pytorch-bot · 2025-10-06T13:00:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14816

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures

As of commit 8d9107b with merge base 2c603e4 ():

NEW FAILURES - The following jobs have failed:

pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t a78b19dc8dbe5bb37605691e67c3138a2560ca2b44997eb1755a812c990d5d98 /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq
pull / unittest-nxp-neutron / linux-job (gh)
RuntimeError: Command docker exec -t a28a911acf64b2716ce18362d6c01d779b21d8de1b5855e1ba7c9ccd9af459dd /exec failed with exit code 1
Test CUDA Builds / test-voxtral-cuda-e2e / linux-job (gh)
RuntimeError: Command docker exec -t 574b6952338209da15edb19333cc684ba1964b26189f874c3421a7c0847e87ad /exec failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

roman-janik-nxp · 2025-10-06T13:00:43Z

@pytorchbot label "module: nxp" "release notes: nxp"

backends/nxp/quantizer/patterns.py

backends/nxp/backend/edge_helper.py

backends/nxp/backend/neutron_target_spec.py

MartinPavella · 2025-10-06T13:34:26Z

backends/nxp/neutron_partitioner.py

+        exir_ops.edge.aten.hardtanh.default,
+        exir_ops.edge.aten.relu.default,
+        exir_ops.edge.aten.sigmoid.default,
+        exir_ops.edge.aten.tanh.default,


Is this necessary?
I would expect that since the Move*AuxiliaryOperatorIntoSeparateQDQClusterPass now supports these operators, they should be the main nodes within their own QDQ clusters.

I've looked into into it. The reason is it's in the QDQClusterRecognizer AUXILIARY_OPS list not only for the use in partitioner, but because the QDQClusterRecognizer is also used in the Move*AuxiliaryOperatorIntoSeparateQDQClusterPass:

executorch/backends/nxp/edge_passes/move_auxiliary_operator_into_separate_qdq_cluster_pass.py

Line 233 in 33f153a

cluster = QDQClusterRecognizer().get_qdq_cluster(main_cluster_node)

backends/nxp/quantizer/patterns.py

+ Move fused activations to separate QDQ cluster

roman-janik-nxp requested review from MartinPavella, StrycekSimon and jirioc October 6, 2025 13:00

roman-janik-nxp requested a review from robert-kalmar as a code owner October 6, 2025 13:00

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 6, 2025

pytorch-bot bot added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Oct 6, 2025

MartinPavella reviewed Oct 6, 2025

View reviewed changes

roman-janik-nxp added 2 commits October 8, 2025 14:26

Remove Fuse activation functions IR optimization

1321ace

Make Relu quantization non-shared

0f35851

roman-janik-nxp force-pushed the feature/nxg11066/EIEX-469-EIEX-470-Replace-pass-to-fuse-activations-functions_github branch from 6fdef26 to 33f153a Compare October 9, 2025 10:01

Quantize Addmm, Conv2d, Linear, Mm together with fusable activations

8d9107b

+ Move fused activations to separate QDQ cluster

roman-janik-nxp force-pushed the feature/nxg11066/EIEX-469-EIEX-470-Replace-pass-to-fuse-activations-functions_github branch from 33f153a to 8d9107b Compare October 9, 2025 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP backend: Replace pass to fuse activations functions with joint quantization with activation #14816

NXP backend: Replace pass to fuse activations functions with joint quantization with activation #14816

Uh oh!

roman-janik-nxp commented Oct 6, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

roman-janik-nxp commented Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MartinPavella Oct 6, 2025

Uh oh!

roman-janik-nxp Oct 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NXP backend: Replace pass to fuse activations functions with joint quantization with activation #14816

Are you sure you want to change the base?

NXP backend: Replace pass to fuse activations functions with joint quantization with activation #14816

Uh oh!

Conversation

roman-janik-nxp commented Oct 6, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14816

❌ 4 New Failures

Uh oh!

roman-janik-nxp commented Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MartinPavella Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

roman-janik-nxp Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

roman-janik-nxp commented Oct 6, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Oct 6, 2025 •

edited

Loading