Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.
This work focus on Multi-Agent Cooperation Problem. We proposed a method which consists 3 components:
- Related research - MADDPG This algorithm comes from Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
- Prioritized Batch Data To optimize one-step update without losing diversity, we divide batch data into several parts and prioritize these batches. Using the batch data with maximal loss to do one-step update.
- Distributed Multi-Agent Architecture Similar to A3C algorithm, we adopt this Master and Multi-Worker architecture in our work.
- Keras 2.1.2 (tensorflow 1.4 as backend)
- mpi4py
- Python 3.6
- CUDA 8.0 + cuDNN 6.0
- Modified original environment (you can find in my repo) from OpenAI
- Fixed landmark
- Border
- DDPG & MADDPG & PROPOSED
For program using MPI:
- mpiexec -np [worker_number] python mpi-xxx.py
mpiexec -np 4 python mpirun_main.py
For others:
python xxx.py