This is a simplified version of the code used in the EDDICT paper (to appear at NeurIPS 2021). In this stand-alone Google Colab, EDDICT is trained on a continuous grid world with an uncontrollable distractor. The resulting latent representations can then be seen to yield an interpretable model of the controllable aspects of the environment (i.e. the (x,y) coordinates) while being invariant to the uncontrollable aspects (i.e. the distractor (x,y) coordinates).
Simply open the file in Google Colab
and run the cells in order. Any runtime should work, but a GPU considerably speeds up training.
Run the cells in order to train an EDDICT agent from scratch and visualize its representations. You can also try modifying the environment (e.g. add walls), or ablate the agent (e.g. what if desired z = delta?), and rerun to see what happens.
BibTex for citing the EDDICT paper:
@article{hansen2021entropic,
title={Entropic Desired Dynamics for Intrinsic Control},
author={Hansen, Steven and Desjardins, Guillaume and Baumli, Kate and Warde-Farley, David and Heess, Nicolas and Osindero, Simon and Mnih, Volodymyr},
journal={Advances in Neural Information Processing Systems},
volume={34},
year={2021}
}
This is not an official Google product.