maze-orcs

Deep Reinforcement Learning for End-to-End Maze Navigation

ORCS E4529 - Reinforcement Learning Project - Fall 2023

Team members:
Aymeric Degroote
Pedro Leandro La Rotta
Kunal Kundu

Instructor: Shipra Agrawal

TODO: CNN with action output. Feed the action in an LSTM with hidden_size = 100. Output action directly from LSTM -> Use transfer learning on the first CNN. We know it works. -> LSTM only acts on the action: better interpretability -> Can even try to freeze CNN weights (CNN agnostic to past) Rmk: "action" is actually "action distribution"

To train an agnostic model using REINFORCE on MiniWorld:

python3 reinforce_runs/miniworld-classic-train-agnostic.py

or

python3 reinforce_runs/miniworld-maml-train-agnostic.py

in the terminal.

To fine tune an agnostic model on several mazes and assess performance:

python3 reinforce_runs/miniworld-finetune-agnostic.py

in the terminal.

Replace 'miniworld' by 'minigrid' for the equivalent in MiniGrid.

To train REINFORCE on MiniGrid:

python3 train-minigrid.py

in the terminal.

To evaluate REINFORCE on MiniGrid:

python3 run-minigrid.py

in the terminal.

To see how REINFORCE is doing on MiniGrid:

python3 run-minigrid.py human

in the terminal.

Way to go is by running:

python3 maze_gym.py

in the terminal.

In the training of REINFORCE using a Value function approach, we edited a few files in the miniworld library directly that we did not find relevant to add to the repo. However, those two files are:

manual_control.py
miniworld_control.py

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
archive		archive
reinforce_runs		reinforce_runs
reinforce_utilities		reinforce_utilities
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
maze_gym.py		maze_gym.py
plot_functions.py		plot_functions.py
run_setting.py		run_setting.py
train-cnn.py		train-cnn.py
train-miniworld-one-maze.py		train-miniworld-one-maze.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

maze-orcs

About

Releases 1

Packages

Contributors 3

Languages

License

aymeric-degroote/maze-orcs

Folders and files

Latest commit

History

Repository files navigation

maze-orcs

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages