Official codebase for Rating-Based Reinforcement Learning. Rating-Based Reinforcement Learning is based on the B-Pref codebase which can be found here.
conda env create -f conda_env.yml
pip install -e .[docs,tests,extra]
cd custom_dmcontrol
pip install -e .
cd custom_dmc2gym
pip install -e .
pip install git+https://github.com/rlworkgroup/metaworld.git@master#egg=metaworld
pip install pybullet
Experiments for Walker can be reproduced by running the following command:
./scripts/walker_walk/1000/equal/run_PrefPPO.sh [n = 2, 3, 4, 5, 6]
Experiments for Quadruped can be reproduced by adjusting the reward threshold for specific rating classes and then running the following command:
./scripts/quadruped_walk/2000/equal/run_PrefPPO.sh [n = 2, 3, 4, 5, 6]
Experiments can be reproduced with the following:
./scripts/walker_walk/run_ppo.sh
./scripts/quadruped_walk/run_ppo.sh
@inproceedings{white2024rating,
title={Rating-Based Reinforcement Learning},
author={White, Devin and Wu, Mingkang and Novoseller, Ellen and Lawhern, Vernon J and Waytowich, Nicholas and Cao, Yongcan},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={38},
number={9},
pages={10207--10215},
year={2024}
}