Optim-wip: Add Activation Atlas setup tutorial #729

ProGamerGov · 2021-08-04T23:08:34Z

See #579 for more details

* Update test_atlas.py

…tim-wip-activation-atlas

* Vectorize heatmap function. * Add sample count to vec coords. * Add labels for inception v1 model.

Sample collection should now be faster & use less memory.

* Activation atlas visualization with 1.2 million samples

…tim-wip-activation-atlas

* Docs for the WhitenedNeuronDirection objective. * Fix for visualizations being flipped horizontally.

* Also changed the `self.direction` variable to `self.vec`.

* Improved a number of text cells in both atlas tutorials * Add sections about atlas reproducibility. * Give the user the option to see all class ids and their corresponding class names.

* Remove old code and text for slicing incomplete atlases. * Use more precise language. * Improve the flow of language used between steps.

* Hopefully sample collection is easier to understand this way, as it was previously added as a commented out section to the main activation atlas tutorial. * Improved the description of activation and attributions samples in both visualizing atlases notebooks.

* Also improved the activation atlas sample collection tutorial.

* Move activation atlas tutorials to their own folder. * Move activation atlas sample collection functions to the sample collection tutorial.

…vation-atlas

NarineK · 2021-08-13T00:10:07Z

Thank you for splitting this PRs, @ProGamerGov. It looks like new PRs show 95 other commits in the commit history. Is it possible to clean commit history, squash it into one commit and have the longer commits to be associated only with the original PR ?

For the notebook Collecting Samples for Activation Atlases with captum.optim do you mind pointing to the notebook in lucid? Activation atlas has multiple notebooks and it is unclear which one does this notebook refer to.
I think I'm not ver clear why By default the activation samples will not have the right class attributions. What do we mean by attribution here ? I think that it would be great to define and describe it in the notebook.
Are attributions activations for the modified network ? Why are we enabling the gradients if we are computing only the activations ?  with torch.set_grad_enabled(True): target_activ_attr_dict = opt.models.collect_activations( attr_model, attr_targets, inputs ) 
I think that you might want to describe how is the attribution computed using input * gradients and why are we calling autograd twice. We need to describe the trick that we are using here and reference to the documentation otherwise it's unclear if we try to understand it from the tutorial.
It looks like in the attribute_samples function we are saving activation and attribution vectors in a separate file. Wouldn't it be better to concatenate them in the memory and save them all together in one file instead of concatenating them later in consolidate_samples function ?
In the consolidate_samples function when you mention [n_channels, n_samples]. with n_channels do you mean the number of target classes ?
I looked into the tutorial and tutorial related changes but it looks like there is a lot more code in this PR that is not used in the tutorial and there are same copies in PR Optim-wip: Add Class Activation Atlas tutorial #730. For example AngledNeuronDirection is defined in both PR. I think that the changes that aren't related to this tutorial we can leave in one place. e.g. in PR#730

ProGamerGov · 2021-08-13T15:08:25Z

@NarineK

This is the equivalent Lucid notebook: https://colab.research.google.com/github/tensorflow/lucid/blob/master/notebooks/activation-atlas/activation-atlas-collect.ipynb
I'll try to explain it better!
We only need gradients when calculating the attributions, so for the sake of speed and efficiency I added the with torch.set_grad_enabled(True) lines to the attribution part only. If the user isn't interested in collecting attributions, then the code becomes a lot faster. The attributions are collected from the modified model using the special pooling layers while the activations are collected from the unmodified model.
I'll add some references to the double backwards trick like in the Lucid Notebook!
In my testing I found that dumping the collected activations and attributions to files increased the speed by a significant amount. When dealing with 1 million training images, it can slow to a crawl and potentially crash from out of memory errors if I keep them in memory. Saving the batches as files individually also means that you can still have usable data if it crashes at 99%.
n_channels in the consolidate_samples docs refers to the number of output channels in the saved / loaded activation tensors. I should probably clarify this better!
This PR was meant to be reviewed after: Optim-wip: Add Activation Atlas tutorial & functions #579, so I kept the core code. But I can remove the shared code. I may have to just make a new PR and close this one in order to clean up the commits.

NarineK · 2021-08-12T18:50:18Z

captum/optim/models/_common.py

+    A relaxed pooling layer, that's useful for calculating attributions of spatial
+    positions. This layer reduces Noise in the gradient through the use of a
+    continuous relaxation of the gradient.
+


Can we perhaps cite it with a reference to the code in lucid ? Gradient of what ? Of output with respect to this layer ?

NarineK · 2021-08-12T18:53:03Z

captum/optim/models/_common.py

+        )
+
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        return self.maxpool(x.detach()) + self.avgpool(x) - self.avgpool(x.detach())


This part is a bit unclear. One time we add and then subtract avgpool for the detached input ?

@NarineK For this layer to work correctly, we want the gradient of the input passed through nn.AvgPool2d, while also using the tensor values from the input passed through nn.MaxPool2d. This is what the line does:

max_input_grad_avg = self.maxpool(x.detach()) + self.avgpool(x) - self.avgpool(x.detach())

As I couldn't seem to separately modify the gradient of the input in the forward pass, this was the solution that Chris came up with.

@ProGamerGov, thank you for the explanation. Does this mean that you want the forward pass to return self.maxpool(x.detach()) but backward pass to be computed for self.avgpool(x) ?
I think it would be great to document it since if we look at it from certain perspective we can say self.avgpool(x) - self.avgpool(x.detach()) = 0, so why do we need it.

@NarineK Yes, we want the forward pass to return self.maxpool(x.detach()) while the backward pass uses self.avgpool(x). Lucid uses TensorFlow's gradient override system to perform the same task with slightly different methodology.

The Activation Atlas paper references this Lucid attribution notebook for where the idea was first used. In the notebook, we see the Lucid version of MaxPool2dRelaxed has the following description:

Construct a pooling function where, if we backprop through it, gradients get allocated proportional to the input activation. Then backprop through that instead.

In some ways, this is kind of spiritually similar to SmoothGrad (Smilkov et al.). To see the connection, note that MaxPooling introduces a pretty arbitrary discontinuity to your gradient; with the right distribution of input noise to the MaxPool op, you'd probably smooth out to this. It seems like this is one of the most natural ways to smooth.

Chris helped me a fair bit with recreating the algorithm in PyTorch, so I may not be explaining how it works correctly. I can mark it down that we need to come back to improve the description in a future PR if you want.

@ProGamerGov, thank you for the explanation. Do you mind adding this description in the code so that in the future we can understand it easily?

@NarineK Okay, I've updated the classes description!

NarineK · 2021-08-16T23:50:31Z

@ProGamerGov, thank you for the replies. Let me know if you create a new PR with cleaned commit history. I'll take a look into it.

ProGamerGov · 2021-09-08T23:17:35Z

@NarineK Closing this PR in favor of: #750

ProGamerGov and others added 30 commits January 5, 2021 18:28

Add Activation Atlas tutorial & functions

740fcde

Add tests for atlas functions & random rotation transform

1127d31

Only test atlas functions with >= torch 1.7.0

e19fe65

Added citations, better atlas docs, & asserts

dcc7743

Improve atlas docs and variables

afde029

* Update test_atlas.py

Update tutorial variable & fix tutorial viz cell

50d2ddf

Improve capture_activation_samples

beeb2af

Improve description

a5a03e8

Don't collect samples from edges

fd45b60

Add missing whitespace to arithmetic operators

c3f0cd4

Merge branch 'optim-wip' of https://github.com/pytorch/captum into op…

997d8d2

…tim-wip-activation-atlas

Update atlas tutorial with corrected colorspace

6ead702

Fix tutorial model transform

1618c4e

Code improvements

c04621f

New samples per image parameter, speed improvements & more

ff50ed6

* Vectorize heatmap function. * Add sample count to vec coords. * Add labels for inception v1 model.

Delete labels for now

81a2c5f

Fix variable

5f51cec

Move WhitenedNeuronDirection to core/loss

bd34b72

Activation atlas tutorial improvements

11fdd19

Implement improved sample collection

84b9f69

Sample collection should now be faster & use less memory.

Fix lint & tests

34853d7

Fix lint errors

d557791

Add List type hint

4819259

Improve activation atlas tutorial

ffac297

* Activation atlas visualization with 1.2 million samples

Improve activation atlas tutorial

b5fbd7d

Merge branch 'optim-wip' of https://github.com/pytorch/captum into op…

e7a0741

…tim-wip-activation-atlas

Improvements to atlas docs, tests, tutorial, & code

765c352

Move to [x,y] graph format & improvements

d5c2cda

Fix broken atlas tests

37490c6

Minor improvements

a39167f

* Docs for the WhitenedNeuronDirection objective. * Fix for visualizations being flipped horizontally.

ProGamerGov added 20 commits May 2, 2021 14:04

Add asserts to direction objectives

00ace09

* Also changed the `self.direction` variable to `self.vec`.

General improvements to both atlas tutorials

cc8ba5e

* Improved a number of text cells in both atlas tutorials * Add sections about atlas reproducibility. * Give the user the option to see all class ids and their corresponding class names.

Clarify where to use t-SNE vs UMAP in atlas tutorials

a2370fa

Improve weights_to_heatmap_2d & nchannels_to_rgb tests

022f2d3

Improvements to both atlas tutorials

be4ca32

* Remove old code and text for slicing incomplete atlases. * Use more precise language. * Improve the flow of language used between steps.

Add adversarial example to class atlas tutorial

c6b6c22

* Also improved the activation atlas sample collection tutorial.

Minor improvements

1f94574

Minor improvements & fixes for both atlas tutorials

7e9e41f

Add comments to the batch targeting section of both atlas tutorials

492561a

Minor correction to atlas docs

569e6af

More minor improvements to atlas tutorials

48f5b63

More tutorial improvements

294c1c7

Fix adversarial example image urls

965a44a

Speed up cov matrix calculations & support any number of channels

ee6992f

Merge branch 'optim-wip' into optim-wip-activation-atlas

0261aab

Reorganize part of the Activation Atlas PR

c4b19b0

* Move activation atlas tutorials to their own folder. * Move activation atlas sample collection functions to the sample collection tutorial.

Remove unused import

1fc4a43

Merge remote-tracking branch 'upstream/optim-wip' into optim-wip-acti…

5cdf9d7

…vation-atlas

Only commit atlas setup specific changes

d77972e

facebook-github-bot added the cla signed label Aug 4, 2021

ProGamerGov mentioned this pull request Aug 5, 2021

Optim-wip: Add Activation Atlas tutorial & functions #579

Closed

NarineK reviewed Aug 13, 2021

View reviewed changes

This was referenced Sep 8, 2021

Optim-wip: Add Activation Atlas setup tutorial & supporting code #749

Closed

Optim-wip: Add Activation Atlas sample collection tutorial & supporting code #750

Merged

ProGamerGov closed this Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optim-wip: Add Activation Atlas setup tutorial #729

Optim-wip: Add Activation Atlas setup tutorial #729

Uh oh!

ProGamerGov commented Aug 4, 2021

Uh oh!

NarineK commented Aug 13, 2021

Uh oh!

ProGamerGov commented Aug 13, 2021

Uh oh!

NarineK Aug 12, 2021

Uh oh!

NarineK Aug 12, 2021

Uh oh!

ProGamerGov Sep 6, 2021 •

edited

Loading

Uh oh!

NarineK Sep 23, 2021

Uh oh!

ProGamerGov Sep 24, 2021 •

edited

Loading

Uh oh!

ProGamerGov Sep 24, 2021

Uh oh!

NarineK Sep 26, 2021

Uh oh!

ProGamerGov Sep 26, 2021

Uh oh!

NarineK commented Aug 16, 2021

Uh oh!

ProGamerGov commented Sep 8, 2021

Uh oh!

Uh oh!

Optim-wip: Add Activation Atlas setup tutorial #729

Optim-wip: Add Activation Atlas setup tutorial #729

Uh oh!

Conversation

ProGamerGov commented Aug 4, 2021

Uh oh!

NarineK commented Aug 13, 2021

Uh oh!

ProGamerGov commented Aug 13, 2021

Uh oh!

NarineK Aug 12, 2021

Choose a reason for hiding this comment

Uh oh!

NarineK Aug 12, 2021

Choose a reason for hiding this comment

Uh oh!

ProGamerGov Sep 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NarineK Sep 23, 2021

Choose a reason for hiding this comment

Uh oh!

ProGamerGov Sep 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ProGamerGov Sep 24, 2021

Choose a reason for hiding this comment

Uh oh!

NarineK Sep 26, 2021

Choose a reason for hiding this comment

Uh oh!

ProGamerGov Sep 26, 2021

Choose a reason for hiding this comment

Uh oh!

NarineK commented Aug 16, 2021

Uh oh!

ProGamerGov commented Sep 8, 2021

Uh oh!

Uh oh!

ProGamerGov Sep 6, 2021 •

edited

Loading

ProGamerGov Sep 24, 2021 •

edited

Loading