Improving Rainbow: Improving Improvements in Deep Reinforcement Learning

by Alexander Ludwig & Sören Viegener

This is an implementation of the Rainbow reinforcement learning agent presented by Hessel et al. The implementation uses parallel asynchronous environments and has some extensions to the original Rainbow agent:

Different neural network architectures, namely the original DQN architecture, the Impala CNN, and D2RL
Different exploration strategies, namely epsilon-greedy, noisy nets, softmax exploration, and random network distillation

Running the Agent

To train the agent with the original Rainbow settings: (note that this requires a LOT of RAM. At least around 50 GB)

python main.py --log_wandb=False

Results

Some of the best episodes of the five games played by setup 3 can be watched on Youtube:
https://youtube.com/playlist?list=PLdeppp6CMwaRKorJJUJzSIcHffu37su-r

This was created as part of the Project Deep Reinforcement Learning at Ulm University

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
configs		configs
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
env_utils.py		env_utils.py
env_wrappers.py		env_wrappers.py
gym_wrappers.py		gym_wrappers.py
loss_functions.py		loss_functions.py
main.py		main.py
model.py		model.py
replay_buffer.py		replay_buffer.py
reward_model.py		reward_model.py
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Rainbow: Improving Improvements in Deep Reinforcement Learning

Running the Agent

Results

About

Releases

Packages

Languages

GL-Ludo/pdrl_rainbow

Folders and files

Latest commit

History

Repository files navigation

Improving Rainbow: Improving Improvements in Deep Reinforcement Learning

Running the Agent

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages