Add Gradient SHAP for layer and neuron #175

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

NarineK wants to merge 11 commits into pytorch:master from NarineK:add_layer_neuron_gradient_shap

Contributor

NarineK commented Nov 18, 2019 •

edited

Loading

This PR adds Gradient SHAP for layer and neuron with documentation and tests cases including tests for data parallel.
It also moves _compute_conv_delta_and_format_attrs function to common.py in order to be able to use it both in DeepLift and Gradient SHAP.
Some of the auxiliary test functions are also shared between Gradient SHAP and DeepLift now.

NarineK commented

View reviewed changes

tests/attr/test_data_parallel.py

@@ @@ -454,6 +508,7 @@ def _data_parallel_test_assert( @@
                                   )
                               else:
                                   attributions_orig = attr_orig.attribute(**kwargs)
+                          self.setUp()

Contributor Author

NarineK Nov 18, 2019

I added this for determinism - in order to reinitiate the seeds. I'll pass seed to attribute in a separate PR.

NarineK requested review from vivekmig and orionr

November 18, 2019 18:11

vivekmig approved these changes

View reviewed changes

Contributor

vivekmig left a comment

Looks good 👍 ! Sorry for the delay in reviewing. Just some nits on documentation and suggestions for tests.

captum/attr/_core/layer/layer_gradient_shap.py Outdated

+                      It adds white noise to each input sample `n_samples` times, selects a
+                      random baseline from baselines' distribution and a random point along the
+                      path between the baseline and the input, and computes the gradient of outputs
+                      with respect to those selected random points. The final SHAP values represent

Contributor

vivekmig Nov 20, 2019

nit: Can update this to explain the logic for layer, e.g. gradient of output with respect to layer evaluation at selected random point

captum/attr/_core/layer/layer_gradient_shap.py Outdated

+                      random baseline from baselines' distribution and a random point along the
+                      path between the baseline and the input, and computes the gradient of outputs
+                      with respect to those selected random points. The final SHAP values represent
+                      the expected values of gradients * (inputs - baselines).

Contributor

vivekmig Nov 20, 2019

nit: Same here, layer evaluation at inputs and baselines.

captum/attr/_core/layer/layer_gradient_shap.py Outdated

+                                      tensor must correspond to the batch size. It will be
+                                      repeated for each `n_steps` for each randomly generated
+                                      input sample.
+                                      Note that the gradients are not computed with respect

Contributor

vivekmig Nov 20, 2019

nit: attributions are not computed

captum/attr/_core/layer/layer_gradient_shap.py Outdated

+                          **attributions** or 2-element tuple of **attributions**, **delta**:
+                          - **attributions** (*tensor* or tuple of *tensors*):
+                                      Attribution score computed based on GradientSHAP with
+                                      respect to each input feature. Attributions will always be

Contributor

vivekmig Nov 20, 2019

nit: with respect to each neuron in input / output of layer

captum/attr/_core/layer/layer_gradient_shap.py

+                      input_baseline_scaled = tuple(
+                          self._scale_input(input, baseline, rand_coefficient)
+                          for input, baseline in zip(inputs, baselines)

Contributor

vivekmig Nov 20, 2019

Do we want to expose this class for direct usage as well? If so, we need to also format baselines (and probably inputs as well) here, otherwise calling this with the default baselines of None will fail. It seems like the same is true for InputBaselineXGradient

Contributor Author

NarineK Nov 23, 2019

Currently, this class is not exposed. I was thinking to expose it, but that can be also done in a separate PR.

tests/attr/layer/test_layer_gradient_shap.py Outdated

+                          return_convergence_delta=True,
+                          attribute_to_layer_input=attribute_to_layer_input,
+                      )
+                      assertTensorAlmostEqual(self, attrs[0], expected, 0.005)

Contributor

vivekmig Nov 20, 2019

nit: Can change this to attrs and just obtain the tensor to be consistent with other layer methods?

tests/attr/layer/test_layer_gradient_shap.py

+                      n_samples = 10
+                      # 10-class classification model
+                      model = SoftmaxModel(num_in, 20, 10)

Contributor

vivekmig Nov 20, 2019

nit: Same here, weights in SoftmaxModel depend on the random initialization, which may not be consistent between versions, so expected attributions could change? Could possibly switch to a model with deterministic weights, e.g. BasicModel_MultiLayer or BasicModel_ConvNet_One_Conv?

Contributor Author

NarineK Nov 23, 2019

This class is used in many places. We can set fixed weights in a separate PR for SoftmaxModel

tests/attr/neuron/test_neuron_gradient_shap.py Outdated

+                      if callable(baselines):
+                          baselines = baselines(inputs)
+                      baselines = torch.mean(baselines[0], axis=0, keepdim=True)

Contributor

vivekmig Nov 20, 2019

I think it isn't necessarily true that averaging the baseline values and computing Neuron IG with respect to that should match NeuronGradientShap for any non-linear model. Could we instead possibly compute neuron IG attributions with respect to each baseline and average those? I think that should theoretically match NeuronGradientShap with sufficient samples (and small stdev), so hopefully the delta can be reduced with that test as well.

tests/attr/neuron/test_neuron_gradient_shap.py

+              from captum.attr._core.neuron.neuron_integrated_gradients import (
+                  NeuronIntegratedGradients,
+              )

Contributor

vivekmig Nov 20, 2019

nit: For both neuron and layer, could potentially add a test with multiple input tensors / additional args to confirm these work fine?

tests/attr/test_data_parallel.py Outdated

+                          baselines=baselines,
+                          additional_forward_args=None,
+                          test_batches=False,
+                      )

Contributor

vivekmig Nov 20, 2019

nit: Could potentially set alt_device_ids=True on some of these tests / add tests with it, since this verifies that things work appropriately with different device id orderings (essentially that device_ids are being passed appropriately).

NarineK added 6 commits

November 22, 2019 20:35


          Initial version of Gradient SHAP for layer and neuron

a789348


          Fix failing dp tests

3c02766


          add shortcuts and fix device_id

4cfe616


          Fix formatting in __init__ file

19518f2


          Fix the bug with attr_inputs

762613c


          Address comments

8297a0f

NarineK force-pushed the add_layer_neuron_gradient_shap branch from 6f866b8 to 8297a0f Compare

November 23, 2019 04:35

NarineK added 5 commits

November 22, 2019 20:43


          Fix constructors

1e3350d


          Fix failing test cases

5cc2e87


          Add NeuronGradientShap check in dataparallel tests

c2d1fdd


          remove unused import

a94114b


          Add neuron index

660a104

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@NarineK has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot reviewed

View reviewed changes

Contributor

facebook-github-bot left a comment

@NarineK has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

c04f983

miguelmartin75 pushed a commit to miguelmartin75/captum that referenced this pull request


          Add Gradient SHAP for layer and neuron (pytorch#175)

a32228f

Summary:
This PR adds Gradient SHAP for layer and neuron with documentation and tests cases including tests for data parallel.
It also moves `_compute_conv_delta_and_format_attrs ` function to `common.py` in order to be able to use it both in DeepLift and Gradient SHAP.
Some of the auxiliary test functions  are also shared between Gradient SHAP and DeepLift now.
Pull Request resolved: pytorch#175

Differential Revision: D18680948

Pulled By: NarineK

fbshipit-source-id: 578c756db09e4069c422dca1d0fb2c360b19d950

miguelmartin75 pushed a commit to miguelmartin75/captum that referenced this pull request


          Add Gradient SHAP for layer and neuron (pytorch#175)

843b033

Summary:
This PR adds Gradient SHAP for layer and neuron with documentation and tests cases including tests for data parallel.
It also moves `_compute_conv_delta_and_format_attrs ` function to `common.py` in order to be able to use it both in DeepLift and Gradient SHAP.
Some of the auxiliary test functions  are also shared between Gradient SHAP and DeepLift now.
Pull Request resolved: pytorch#175

Differential Revision: D18680948

Pulled By: NarineK

fbshipit-source-id: 578c756db09e4069c422dca1d0fb2c360b19d950

NarineK added a commit to NarineK/captum-1 that referenced this pull request


          Add Gradient SHAP for layer and neuron (pytorch#175)

c031b10

Summary:
This PR adds Gradient SHAP for layer and neuron with documentation and tests cases including tests for data parallel.
It also moves `_compute_conv_delta_and_format_attrs ` function to `common.py` in order to be able to use it both in DeepLift and Gradient SHAP.
Some of the auxiliary test functions  are also shared between Gradient SHAP and DeepLift now.
Pull Request resolved: pytorch#175

Differential Revision: D18680948

Pulled By: NarineK

fbshipit-source-id: 578c756db09e4069c422dca1d0fb2c360b19d950

ProGamerGov mentioned this pull request

I discovered what appears to be a forgotten class, called captum.attr.InputBaselineXGradient #989

Open

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet