Lazily compute the attention layer size #487

guillaumekln · 2019-09-09T08:44:25Z

This fixes using a custom attention layer which required the AttentionMechanism instances to be initialized with a memory at the time the AttentionWrapper is created.

Fixes #461.

This fixes using a custom attention layer which required the AttentionMechanism instances to be initialized with a memory at the time the AttentionWrapper is created.

qlzh727

I am ok with this change to support custom attention, on the other hand, it might introduce some latency since the value is now calculated on fly multiple times. Can we cache the value once after its calculated?

guillaumekln requested a review from qlzh727 as a code owner September 9, 2019 08:44

googlebot added the cla: yes label Sep 9, 2019

guillaumekln changed the title ~~Lazily the compute attention layer size~~ Lazily compute the attention layer size Sep 9, 2019

Lazily compute the attention layer size

1d00821

This fixes using a custom attention layer which required the AttentionMechanism instances to be initialized with a memory at the time the AttentionWrapper is created.

guillaumekln added the seq2seq label Sep 9, 2019

qlzh727 previously approved these changes Sep 9, 2019

View reviewed changes

Cache attention layer size once computed

75d33ea

guillaumekln dismissed qlzh727’s stale review via 75d33ea September 9, 2019 16:10

qlzh727 approved these changes Sep 9, 2019

View reviewed changes

seanpmorgan merged commit 6633c43 into tensorflow:master Sep 10, 2019

guillaumekln deleted the lazy-attention-layer-size branch June 9, 2020 08:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lazily compute the attention layer size #487

Lazily compute the attention layer size #487

Uh oh!

guillaumekln commented Sep 9, 2019

Uh oh!

qlzh727 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Lazily compute the attention layer size #487

Lazily compute the attention layer size #487

Uh oh!

Conversation

guillaumekln commented Sep 9, 2019

Uh oh!

qlzh727 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants