Fix non sensitivity #369

Tprojects66554 · 2025-11-06T08:17:21Z

Description

This pull request addresses two critical issues in the Non-Sensitivity metric related to the features_in_step parameter and the internal logic of pixel perturbations.

Logical inconsistency in perturbation evaluation
When features_in_step > 1, multiple pixels (both feature and non-feature) are perturbed simultaneously within the same step.
As a result, the computed difference between the perturbed and original predictions (y_pred) corresponds to mixed pixel groups, making it impossible to determine whether individual pixels preserve the model’s insensitivity property.
Shape mismatch causing ValueError
When running the metric with features_in_step != 1, Quantus raised:
```
ValueError: operands could not be broadcast together with shapes (17,24) (17,150528)
```
This occurred at:
```
return (preds_differences ^ non_features).sum(-1)
```
where:
- non_features → shape (batch_size, n_features)
- preds_differences → shape (batch_size, n_perturbations)
  Since n_perturbations != n_features for multi-step perturbations, the XOR (^) operation failed due to incompatible dimensions.

These issues caused both logical misinterpretation of sensitivity violations and runtime failures during evaluation.
Link to the issue

https://github.com/understandable-machine-intelligence-lab/Quantus/issues/367

Implemented changes

Rewrote the perturbation loop to evaluate per-pixel sensitivity, ensuring clean separation between feature and non-feature perturbations.
Adjusted the accumulation logic to correctly track prediction stability across perturbation steps.
Fixed the broadcasting mismatch by aligning array dimensions and reshaping operations before computing sensitivity violations.
Added debug information and improved documentation for reproducibility and clarity.

Minimum acceptance criteria

All tests under tests/metrics/test_non_sensitivity_metric.py and related evaluation modules pass successfully across supported environments (py310–py311).
The metric produces consistent scores for all features_in_step configurations without shape or logic errors.
Reviewer confirmation by @annahedstroem or a Quantus core maintainer.

Tprojects66554 · 2025-11-12T14:10:16Z

Hi @annahedstroem I would appreciate it if you could take a look at the PR to see what you think about it.

Tprojects66554 added 5 commits November 4, 2025 18:53

after_testing_in_py310

8eb343b

testing_non_sensitivity_with_features_in_step_2

15e4c4a

with_logic_tests

44217f2

change evaluate_batch_ documentation

d244aa3

Splitting functions for SRP

4190c55

Tprojects66554 marked this pull request as ready for review November 9, 2025 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix non sensitivity #369

Fix non sensitivity #369

Tprojects66554 commented Nov 6, 2025 •

edited

Loading

Uh oh!

Tprojects66554 commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix non sensitivity #369

Are you sure you want to change the base?

Fix non sensitivity #369

Conversation

Tprojects66554 commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Implemented changes

Minimum acceptance criteria

Uh oh!

Tprojects66554 commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Tprojects66554 commented Nov 6, 2025 •

edited

Loading