computing attention scores using relative attention bias #1832

pmabbo13 · 2022-07-13T17:24:21Z

Stack from ghstack (oldest at bottom):

WIP PR to workshop implementation: #1812

[ghstack-poisoned]

torchtext/prototype/t5/modules.py

WIP PR to workshop implementation: #1812 [ghstack-poisoned]

pmabbo13 · 2022-07-15T15:24:47Z

Description

Having computed the relative attention bias term, this method computes the attention scores. The implementation is very similar to the nn.Functional._scaled_dot_product_attention, expect that we pass in position_bias as an input argument so that relative attention bias can be incorporated in the computation of the attention scores.

Since the input tensors to this function are 4-dimensional, we replace the torch.baddbmm and torch.bmm with torch.matmul. Since we are no longer using torch.baddbmm where attn_mask was passed as an input argument, we instead add the attn_mask directly to position_bias to ensure the mask is still applied before the softmax is taken to get the final attention scores.

parmeet

LGTM!

WIP PR to workshop implementation: #1812 [ghstack-poisoned]

computing attention scores using relative attention bias

3d5879c

[ghstack-poisoned]

facebook-github-bot added the cla signed label Jul 13, 2022

pmabbo13 added 2 commits July 13, 2022 13:40

Update on "computing attention scores using relative attention bias"

7028f1b

[ghstack-poisoned]

Update on "computing attention scores using relative attention bias"

026d4d6

[ghstack-poisoned]

pmabbo13 requested a review from Nayef211 July 13, 2022 18:15

Nayef211 approved these changes Jul 13, 2022

View reviewed changes

torchtext/prototype/t5/modules.py Outdated Show resolved Hide resolved

pmabbo13 mentioned this pull request Jul 14, 2022

Add T5 Model and Demo on Text Summarization using CNNDM Dataset #1800

Closed

25 tasks

Update on "computing attention scores using relative attention bias"

29d73f2

WIP PR to workshop implementation: #1812 [ghstack-poisoned]

pmabbo13 requested review from abhinavarora and parmeet July 15, 2022 15:57

parmeet approved these changes Jul 15, 2022

View reviewed changes

pmabbo13 added 2 commits July 15, 2022 16:58

Update on "computing attention scores using relative attention bias"

a9be77b

WIP PR to workshop implementation: #1812 [ghstack-poisoned]

Update on "computing attention scores using relative attention bias"

95dd347

WIP PR to workshop implementation: #1812 [ghstack-poisoned]

pmabbo13 merged commit 9ed4314 into gh/pmabbo13/12/base Jul 18, 2022

facebook-github-bot deleted the gh/pmabbo13/12/head branch August 18, 2022 14:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

computing attention scores using relative attention bias #1832

computing attention scores using relative attention bias #1832

Uh oh!

pmabbo13 commented Jul 13, 2022 •

edited

Loading

Uh oh!

Uh oh!

pmabbo13 commented Jul 15, 2022

Uh oh!

parmeet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

computing attention scores using relative attention bias #1832

computing attention scores using relative attention bias #1832

Uh oh!

Conversation

pmabbo13 commented Jul 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pmabbo13 commented Jul 15, 2022

Description

Uh oh!

parmeet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pmabbo13 commented Jul 13, 2022 •

edited

Loading