Trust Region Policy Optimization (TRPO) algorithm implementation with TensorFlow 2 framework. Project developed for Reinforcement Learning exam of professor R. Capobianco.
- Install anaconda
- Create the environment:
$ conda create --name trpo python==3.7
- Activate the environment:
$ conda activate trpo
- Install requirements:
pip install gym[all]