Entropic Desired Dynamics for Intrinsic ConTrol (EDDICT), a self-contained JAX implementation

This is a simplified version of the code used in the EDDICT paper (to appear at NeurIPS 2021). In this stand-alone Google Colab, EDDICT is trained on a continuous grid world with an uncontrollable distractor. The resulting latent representations can then be seen to yield an interpretable model of the controllable aspects of the environment (i.e. the (x,y) coordinates) while being invariant to the uncontrollable aspects (i.e. the distractor (x,y) coordinates).

Installation

Simply open the file in Google Colab and run the cells in order. Any runtime should work, but a GPU considerably speeds up training.

Usage

Run the cells in order to train an EDDICT agent from scratch and visualize its representations. You can also try modifying the environment (e.g. add walls), or ablate the agent (e.g. what if desired z = delta?), and rerun to see what happens.

Citing this work

BibTex for citing the EDDICT paper:

@article{hansen2021entropic,
  title={Entropic Desired Dynamics for Intrinsic Control},
  author={Hansen, Steven and Desjardins, Guillaume and Baumli, Kate and Warde-Farley, David and Heess, Nicolas and Osindero, Simon and Mnih, Volodymyr},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

Disclaimer

This is not an official Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
colab_demo.ipynb		colab_demo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Entropic Desired Dynamics for Intrinsic ConTrol (EDDICT), a self-contained JAX implementation

Installation

Usage

Citing this work

Disclaimer

About

Releases

Packages

Languages

License

google-deepmind/EDDICT

Folders and files

Latest commit

History

Repository files navigation

Entropic Desired Dynamics for Intrinsic ConTrol (EDDICT), a self-contained JAX implementation

Installation

Usage

Citing this work

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages