Decoupled weight decay #164

PhilJd · 2019-04-11T12:54:02Z

This PR ports the decoupled weight decay optimizers (SGDW, AdamW) to tensorflow 2.0, minus all v1 tests, as tensorflow_addons depends on tf-2.0 anyway.
Note that I factored out the testing code that is duplicated in most optimizer tests (also in base tensorflow) into optimizer_test_base. After this PR has been merged, I'd adapt the LazyAdam optimizer to inherit from OptimizerTestBase.

Sorry for the long delay!
Cheers,
Phil :)

Closes #24.

tests.

tensorflow_addons/optimizers/optimizer_test_base.py

tensorflow_addons/optimizers/weight_decay_optimizers_test.py

facaiy · 2019-04-15T02:19:52Z

@PhilJd Welcome, thanks for the PR, and your patience. I'll take a look later this week :-)

mfojtak · 2019-04-16T07:24:47Z

You probably need to add this

from tensorflow_addons.optimizers.weight_decay_optimizers import AdamWOptimizer

into tensorflow_addons/optimizers/__init__.py in order to expose those classes

PhilJd · 2019-04-17T09:18:29Z

Thanks! I've added the respective classes and functions to __init__.py :)

facaiy

Thanks for the PR, I'll take another look at weekend :-)

tensorflow_addons/optimizers/optimizer_test_base.py

facaiy · 2019-04-18T12:39:51Z

tensorflow_addons/optimizers/optimizer_test_base.py

+                        self.evaluate(repeated_index_update_var))
+
+
+if __name__ == "__main__":


Do you need it?

…er_test_base.

PhilJd · 2019-04-18T13:54:46Z

Thanks for the comments @facaiy, I've updated the PR!

facaiy

Nice work! Leave some questions for tf 2.0

tensorflow_addons/optimizers/optimizer_test_base.py

tensorflow_addons/optimizers/weight_decay_optimizers.py

tensorflow_addons/optimizers/weight_decay_optimizers_test.py

tensorflow_addons/optimizers/optimizer_test_base.py

tensorflow_addons/optimizers/weight_decay_optimizers.py

facaiy · 2019-04-24T05:37:45Z

@PhilJd Phil, don't forget to run make code-format :-)

facaiy · 2019-04-25T01:55:52Z

@PhilJd Hi, Phil. Could you address all comments? Thank you for the high quality PR, can't wait to merge it :-)

…izer tests, optimizer params are now keywords instead of a dict. Fix code in comments to support tf-2.0, naming errors, line length.

PhilJd · 2019-04-25T13:12:20Z

@facaiy Thanks a lot for the comments! :)
I've addressed them all, except for the tests. I think someone put testing on the agenda for the next SIG call.
On top of that:

I found additional documentation bugs, code samples were not compatible with tf-2.0.
The last commit also fixes extend_with_weight_decay function doesn't exist? tensorflow#26360.
I've renamed AdamWOptimizer/SGDWOptimizer to AdamW/SGDW, which follows the keras naming scheme instead of tf-1.0 naming scheme
AdamW and SGDW now have a note that the decay also needs to be scheduled, originally only the extend function and the base class had this note.
I've updated the paper title to the title of the published version

By the way, I ran make code-format before, but it seems this doesn't touch comments ;)
Thanks again for taking a detailed look! I also hope we can merge soon :)

facaiy

very close :-)

tensorflow_addons/optimizers/weight_decay_optimizers.py

tensorflow_addons/optimizers/weight_decay_optimizers_test.py

tensorflow_addons/optimizers/README.md

factory function.

PhilJd · 2019-04-29T07:59:57Z

Oops, I forgot to commit optimizer_test_base ;)
Thanks for the comments, they should be resolved with the latest commit!

facaiy · 2019-04-29T22:07:18Z

Looks great, thank you, PhilJ! Could you resolve the merge conflict with master branch?

By the way, you can put your name in the contact info list and code owner file if you'd like to maintain the module contributed by yourself.

facaiy · 2019-04-29T22:10:17Z

@seanpmorgan @WindQAQ Sean, Tzu-Wei, do you have any concerns about this change?

seanpmorgan · 2019-04-29T22:48:41Z

@seanpmorgan @WindQAQ Sean, Tzu-Wei, do you have any concerns about this change?

No, looks like a very nice PR.. Just needs to resolve conflicts and test IMO

PhilJd · 2019-04-30T07:18:02Z

Conflicts with the master are resolved and I've put my name as maintainer into the README ;)

PhilJd · 2019-04-30T07:39:19Z

Interesting, running make code-format locally doesn't complain.

facaiy · 2019-04-30T07:42:35Z

@PhilJd Phil, could you run make code-format again? And if you'd like, please add the line below

addons/.github/CODEOWNERS

Line 32 in 4118a11

/tensorflow_addons/losses/sparsemax*.py @AndreasMadsen

/tensorflow_addons/optimizers/weight_decay_optimizers*.py  @PhilJd

facaiy · 2019-04-30T07:43:14Z

Please ping me when you get it done, and I'll merge it. Thanks for your patience :-)

PhilJd · 2019-04-30T08:36:05Z

I've applied the patch from the build server. I wasn't able to find out why the code-format doesn't complain locally. I've tried the clang format version the build task uses, wget https://llvm.org/svn/llvm-project/cfe/trunk/tools/clang-format/git-clang-format, but still no changes...

facaiy

Thanks!

PhilJd added 2 commits April 11, 2019 14:40

Add decoupled weight decay optimizers and helper class for optimizer

93450a2

tests.

Adapt README.md. Fix broken link to keras_utils.

821ac69

PhilJd requested a review from a team as a code owner April 11, 2019 12:54

googlebot added the cla: yes label Apr 11, 2019

seanpmorgan added the optimizers label Apr 11, 2019

armando-fandango reviewed Apr 11, 2019

View reviewed changes

tensorflow_addons/optimizers/optimizer_test_base.py Outdated Show resolved Hide resolved

tensorflow_addons/optimizers/weight_decay_optimizers_test.py Outdated Show resolved Hide resolved

Remove TF private API calls.

ec55b29

seanpmorgan changed the title ~~Philjd/decaoupled weight decay~~ Decoupled weight decay Apr 13, 2019

facaiy self-assigned this Apr 15, 2019

facaiy requested review from a team, facaiy and seanpmorgan April 15, 2019 02:17

Add imports to __init__.py

aa3d7b0

facaiy requested changes Apr 18, 2019

View reviewed changes

Fix indentation of comments, remove call to tf.test.main from optimiz…

9ebc02b

…er_test_base.

facaiy reviewed Apr 22, 2019

View reviewed changes

facaiy added the kokoro:force-run label Apr 22, 2019

kokoro-team removed the kokoro:force-run label Apr 22, 2019

PhilJd mentioned this pull request Apr 25, 2019

extend_with_weight_decay function doesn't exist? tensorflow/tensorflow#26360

Closed

Move optimizer_test_base into weight_decay_test for now. In the optim…

98a42c6

…izer tests, optimizer params are now keywords instead of a dict. Fix code in comments to support tf-2.0, naming errors, line length.

seanpmorgan added the kokoro:force-run label Apr 25, 2019

kokoro-team removed the kokoro:force-run label Apr 25, 2019

facaiy reviewed Apr 27, 2019

View reviewed changes

tensorflow_addons/optimizers/weight_decay_optimizers.py Outdated Show resolved Hide resolved

tensorflow_addons/optimizers/weight_decay_optimizers_test.py Outdated Show resolved Hide resolved

tensorflow_addons/optimizers/README.md Outdated Show resolved Hide resolved

Delete optimizer_test_base.py Remove keras object registration in the

077924c

factory function.

PhilJd requested a review from WindQAQ as a code owner April 29, 2019 07:57

facaiy previously approved these changes Apr 29, 2019

View reviewed changes

facaiy added the awaiting testing (then merge) label Apr 29, 2019

WindQAQ previously approved these changes Apr 30, 2019

View reviewed changes

Merge branch 'master' into philjd/decaoupled_weight_decay

f8b6bdf

PhilJd dismissed stale reviews from WindQAQ and facaiy via f8b6bdf April 30, 2019 07:12

facaiy added the kokoro:force-run label Apr 30, 2019

kokoro-team removed the kokoro:force-run label Apr 30, 2019

facaiy self-requested a review April 30, 2019 07:31

facaiy previously approved these changes Apr 30, 2019

View reviewed changes

Fix code formatting via patch file.

d6ccae6

PhilJd dismissed facaiy’s stale review via d6ccae6 April 30, 2019 08:33

seanpmorgan added the kokoro:force-run label Apr 30, 2019

kokoro-team removed the kokoro:force-run label Apr 30, 2019

facaiy self-requested a review April 30, 2019 15:25

facaiy approved these changes Apr 30, 2019

View reviewed changes

facaiy merged commit f45f606 into tensorflow:master Apr 30, 2019

seanpmorgan removed the awaiting testing (then merge) label Apr 30, 2019

mingxingtan mentioned this pull request Sep 6, 2019

A correct way to use tf.contrib.opt.AdamWOptimizer tensorflow/tensorflow#31306

Closed

		self.evaluate(repeated_index_update_var))


		if __name__ == "__main__":

Decoupled weight decay #164

Decoupled weight decay #164

Uh oh!

Conversation

PhilJd commented Apr 11, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facaiy commented Apr 15, 2019

Uh oh!

mfojtak commented Apr 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PhilJd commented Apr 17, 2019

Uh oh!

facaiy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

facaiy Apr 18, 2019

Choose a reason for hiding this comment

Uh oh!

PhilJd commented Apr 18, 2019

Uh oh!

facaiy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facaiy commented Apr 24, 2019

Uh oh!

facaiy commented Apr 25, 2019

Uh oh!

PhilJd commented Apr 25, 2019

Uh oh!

facaiy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PhilJd commented Apr 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facaiy commented Apr 29, 2019

Uh oh!

facaiy commented Apr 29, 2019

Uh oh!

seanpmorgan commented Apr 29, 2019

Uh oh!

PhilJd commented Apr 30, 2019

Uh oh!

PhilJd commented Apr 30, 2019

Uh oh!

facaiy commented Apr 30, 2019

Uh oh!

facaiy commented Apr 30, 2019

Uh oh!

PhilJd commented Apr 30, 2019

Uh oh!

facaiy left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

PhilJd commented Apr 11, 2019 •

edited

Loading

mfojtak commented Apr 16, 2019 •

edited

Loading

PhilJd commented Apr 29, 2019 •

edited

Loading