Pop-Art paper by DeepMind

About

Reproducing the example from the paper [1] (tribute).

In addition, we compare Normalized SGD to Pop-Art SGD; while the former uses gradient rescaling and the latter is based on rescaling weights, the two are equivalent in case of squared loss.

Run

To build the Docker image and run the example, use

make run

References

[1] Hasselt et al. Learning values across many orders of magnitude. NIPS 2016.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Pop-Art paper by DeepMind

About

Run

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Pop-Art paper by DeepMind

About

Run

References