Asynchronous Advantage Actor-Critic with Communication in TensorFlow 2

The source-code used on the paper Multi-Agent Reinforcement Deep Learning with Emergent Communication, published on IJCNN'19. The paper describes the A3C2 algorithm, for multi-agent learning, with communication.

The implementation is done using Tensorflow2.

Contains 4 environments (Hidden Reward, Navigation, Pursuit, Traffic Intersection), and scripts to launch A3C2 and learn policies. Use the requirements.txt to install your dependencies and run the scripts.

Each agent is defined by 3 networks.

The algorithm is distributed, and multiple workers update the networks.

Gradients are pushed across multiple time-steps to optimize the communication network and enforce communication.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
BlindGroupUp		BlindGroupUp
Navigation		Navigation
Pursuit		Pursuit
Traffic		Traffic
simulator		simulator
BlindGroupUpBatch.sh		BlindGroupUpBatch.sh
Helper.py		Helper.py
NavBatch.sh		NavBatch.sh
PursuitBatchDist.sh		PursuitBatchDist.sh
README.md		README.md
TestEnv.py		TestEnv.py
TrafficBatch.sh		TrafficBatch.sh
get-pip.py		get-pip.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Asynchronous Advantage Actor-Critic with Communication in TensorFlow 2

About

Packages

Languages

Ralami1859/A3C2-in-TensorFlow-2

Folders and files

Latest commit

History

Repository files navigation

Asynchronous Advantage Actor-Critic with Communication in TensorFlow 2

About

Topics

Resources

Stars

Watchers

Forks

Packages 0

Languages

Packages