Project Title: MAPF Agent's Priority Study

This repository is part of a group project for the module COMP0124 Multi-agent Artificial Intelligence (2023/24) of University College London. The project expands upon the value-based tie breaking mechanism introduced in the paper titled "SCRIMP: Scalable Communication for Reinforcement- and Imitation-Learning-Based Multi-Agent Pathfinding" (Wang et al., 2023). This repository was forked from the authors' repository SCRIMP and contains modifications to the original code to align with our analysis.

Major Modifications

Add conda environment file for python 3.9.
Renamed driver.py to train_model.py.
Add multi_train.py for training multiple models consecutively.
Add argument parser for model training and evaluation scripts.
Upload final model to wandb automatically.
Log model evaluation result to wandb's table.
Add block factor and congestion factor to the probability construction in the tie breaking mechanism.

Usage

Prerequisite

Install python==3.9 and the project dependencies

Using conda, or

$ conda env create -f environment.yml
$ conda activate maai

pip
```
$ pip install -r requirement.txt
```

Setup the OdrM* package

Build the package

$ cd od_mstar3
$ python setup.py build_ext --inplace
$ cd ..

Testing

$ python
>>> import od_mstar3.cpp_mstar  # should be done without any error

Setup wandb for real-time training monitoring and evaluation result
- Register an account https://wandb.ai/site
- Login to wandb on the machine
```
$ conda activate maai  # make sure the environment is on
$ wandb login          # then follows the instructions
```

Model Training

Train a single model
1. Set training parameters in alg_parameters.py.
2. Run the single training script.
```
$ python train_model.py
```
3. Trained models will be stored in the corresponding experiment directory in models/MAPF/ as net_checkpoint.pkl and uploaded in wandb if 1RecordingParameters.wandbis set toTrue`.
Train multiple models
1. Set multiple sets of training configs using CONFIG_SETS in multi_train.py.
2. Run the multi training script to train the models one by one.
```
$ python multi_train.py
```
3. Trained models will be stored in the corresponding experiment directory in models/MAPF/ as net_checkpoint.pkl and uploaded in wandb if 1RecordingParameters.wandbis set toTrue`.

Model Evaluation

Evaluate a single model
1. Locate the model's path, e.g. models/MAPF/expt1/final/net_checkpoint.pkl.
2. Run the evaluation script.
```
$ python eval_model.py models/MAPF/expt1/final/ -n expt --gpu
```
  Notes:
  - The model's directory is used instead of the path to net_checkpoint.pkl.
  - The argument after -n specifies the name of experiment.
  - The --gpu specifies the use of gpu for evaluation.
3. Evaluation results are printed in the terminal and uploaded to wandb.

Key Files

alg_parameters.py - Training parameters.

train_model.py - Single model training program. Holds global training network for PPO.

multi_train.py - Multi models training program. Allow setting multiple sets of training parameters.

runner.py - A single process for collecting training data.

eval_model.py - Single model evaluation program.

mapf_gym.py - Defines the classical Reinforcement Learning environment of Multi-Agent Pathfinding.

episodic_buffer.py - Defines the episodic buffer used to generate intrinsic rewards.

model.py - Defines the neural network-based operation model.

net.py - Defines network architecture.

Group Members

Tian Ruen Woon (tianruen)

Ruibo Zhang (RuiboZhang1)

Yuen Chung Chan (chan-yc)

References

Y. Wang, B. Xiang, S. Huang and G. Sartoretti, "SCRIMP: Scalable Communication for Reinforcement- and Imitation-Learning-Based Multi-Agent Pathfinding," 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 2023, pp. 9301-9308, doi: 10.1109/IROS55552.2023.10342305.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title: MAPF Agent's Priority Study

Major Modifications

Usage

Prerequisite

Model Training

Model Evaluation

Key Files

Group Members

References

About

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
gifs		gifs
od_mstar3		od_mstar3
results		results
transformer		transformer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
alg_parameters.py		alg_parameters.py
environment.yml		environment.yml
episodic_buffer.py		episodic_buffer.py
eval_model.py		eval_model.py
mapf_gym.py		mapf_gym.py
model.py		model.py
multi_train.py		multi_train.py
net.py		net.py
requirements.txt		requirements.txt
runner.py		runner.py
train_model.py		train_model.py
util.py		util.py

License

chan-yc/mapf-priority-study

Folders and files

Latest commit

History

Repository files navigation

Project Title: MAPF Agent's Priority Study

Major Modifications

Usage

Prerequisite

Model Training

Model Evaluation

Key Files

Group Members

References

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 3

Languages