PyTorch Implementation of Michael Jordan’s lab's Perturbed SGD? 

There’s a line of work out of Michael Jordan’s lab regarding perturbed stochastic gradient descent that allegedly has advantages over SGD:

- Gradient Descent Can Take Exponential Time to Escape Saddle Points
- How to Escape Saddle Points Efficiently
- Stochastic Gradient Descent Escapes Saddle Points Efficiently

Is there an implementation of Perturbed SGD in PyTorch as an optimizer? I looked through the available optimizers and the answer appears to be no.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PyTorch Implementation of Michael Jordan’s lab's Perturbed SGD? #21988

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

PyTorch Implementation of Michael Jordan’s lab's Perturbed SGD? #21988

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions