Skip to content

Latest commit

 

History

History
 
 

DRL-Multi-Agent

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

Book

[1].Shoham, Yoav, and Kevin Leyton-Brown. Multiagent systems: Algorithmic, game-theoretic, and logical foundations. Cambridge University Press, 2008.

Overview

[1].Buşoniu, Lucian, Robert Babuška, and Bart De Schutter. "Multi-agent reinforcement learning: An overview." Innovations in multi-agent systems and applications-1. Springer, Berlin, Heidelberg, 2010. 183-221. 代码请点击Code

[2].Dobrev, Dimiter. "The Definition of AI in Terms of Multi Agent Systems." arXiv preprint arXiv:1210.0887 (2012). [3].Multiagent-Reinforcement-Learning (ppt), 2013.
[4].Kapoor, Sanyam. "Multi-agent reinforcement learning: A report on challenges and approaches." arXiv preprint arXiv:1807.09427 (2018).

Algorithm

VDN:

Value-Decomposition Networks For Cooperative Multi-Agent Learning AAMAS 2018

**QMIX: **

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning ICML 2018

JAL:

(JAL) Claus, Caroline, and Craig Boutilier. "The dynamics of reinforcement learning in cooperative multiagent systems." AAAI/IAAI 1998.746-752 (1998): 2.

Distributed Q-learning:

(Distributed Q-learning) Lauer, Martin, and Martin Riedmiller. "An algorithm for distributed reinforcement learning in cooperative multi-agent systems." In Proceedings of the Seventeenth International Conference on Machine Learning. 2000.

Team Q-learning:

(team Q-learning) Littman, Michael L. "Value-function reinforcement learning in Markov games." Cognitive Systems Research 2.1 (2001): 55-66.

FMQ:

(FMQ) Kapetanakis, Spiros, and Daniel Kudenko. "Reinforcement learning of coordination in cooperative multi-agent systems." AAAI/IAAI 2002 (2002): 326-331.

OAL:

(OAL) Wang, Xiaofeng, and Tuomas Sandholm. "Reinforcement learning to play an optimal Nash equilibrium in team Markov games." Advances in neural information processing systems. 2003.

Qi, Dehu, and Ron Sun. "A multi-agent system integrating reinforcement learning, bidding and genetic algorithms." Web Intelligence and Agent Systems: An International Journal 1.3, 4 (2003): 187-202.

TEAM_Q:

(TEAM-Q) Wang, Ying, and Clarence W. De Silva. "Multi-robot box-pushing: Single-agent q-learning vs. team q-learning." 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2006.
FMQ: (FMQ) Matignon, Laëtitia, Guillaume Laurent, and Nadine Le Fort-Piat. "A study of FMQ heuristic in cooperative multi-agent games." 2008.
MADDPG:

(MADDPG) Lowe, Ryan, et al. "Multi-agent actor-critic for mixed cooperative-competitive environments." Advances in Neural Information Processing Systems. 2017.

MARWIL:

Exponentially Weighted Imitation Learning for Batched Historical Data NIPS 2018

Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients." Thirty-Second AAAI Conference on Artificial Intelligence. 2018.

Code

QMIX by Ray framework: https://github.com/ray-project/ray/tree/master/rllib/agents/qmix (also VDN)

MADDPG by Ray framework: https://github.com/ray-project/ray/blob/master/rllib/contrib/maddpg/maddpg.py

MARWIL by Ray framework: https://github.com/ray-project/ray/blob/master/rllib/agents/marwil/marwil.py

Cooperation

[1]. Kok, Jelle R., and Nikos Vlassis. "Sparse cooperative Q-learning." Proceedings of the twenty-first international conference on Machine learning. ACM, 2004.
[2]. Crandall, Jacob W., and Michael A. Goodrich. "Learning to compete, compromise, and cooperate in repeated general-sum games." Proceedings of the 22nd international conference on Machine learning. ACM, 2005.
[3]. Panait, Liviu, and Sean Luke. "Cooperative multi-agent learning: The state of the art." Autonomous agents and multi-agent systems 11.3 (2005): 387-434.
[4]. Kok, Jelle R., and Nikos Vlassis. "Collaborative multiagent reinforcement learning by payoff propagation." Journal of Machine Learning Research 7.Sep (2006): 1789-1828.
[5]. De Cote, Enrique Munoz, Alessandro Lazaric, and Marcello Restelli. "Learning to cooperate in multi-agent social dilemmas." AAMAS. Vol. 6. 2006.
[6]. Ma, Jie, and Stephen Cameron. "Combining policy search with planning in multi-agent cooperation." Robot Soccer World Cup. Springer, Berlin, Heidelberg, 2008.
[7]. Tampuu, Ardi, et al. "Multiagent cooperation and competition with deep reinforcement learning." PloS one 12.4 (2017): e0172395.

Coordination

[1]. Kapetanakis, Spiros, and Daniel Kudenko. "Reinforcement learning of coordination in cooperative multi-agent systems." AAAI/IAAI 2002 (2002): 326-331.
[2]. Lau, Qiangfeng Peter, Mong Li Lee, and Wynne Hsu. "Coordination guided reinforcement learning." Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems-Volume 1. International Foundation for Autonomous Agents and Multiagent Systems, 2012.
[3]. Zhang, Chongjie, and Victor Lesser. "Coordinating multi-agent reinforcement learning with limited communication." Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems. International Foundation for Autonomous Agents and Multiagent Systems, 2013.
[4]. Hao, Jianye, et al. "Reinforcement social learning of coordination in networked cooperative multiagent systems." Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence. 2014.
[5]. Le, Hoang M., et al. "Coordinated multi-agent imitation learning." Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 2017.
[6]. Khadka, Shauharda, Somdeb Majumdar, and Kagan Tumer. "Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination." arXiv preprint arXiv:1906.07315 (2019).

Communicate

[1]. Varshavskaya, Paulina, Leslie Pack Kaelbling, and Daniela Rus. "Efficient distributed reinforcement learning through agreement." Distributed Autonomous Robotic Systems 8. Springer, Berlin, Heidelberg, 2009. 367-378.
[2]. Hausknecht, Matthew John. Cooperation and communication in multiagent deep reinforcement learning. Diss. 2016.
[3]. Sukhbaatar, Sainbayar, and Rob Fergus. "Learning multiagent communication with backpropagation." Advances in Neural Information Processing Systems. 2016.
[4]. Foerster, Jakob, et al. "Learning to communicate with deep multi-agent reinforcement learning." Advances in Neural Information Processing Systems. 2016.

Application

[1]. Zheng, Lianmin, et al. "MAgent: A many-agent reinforcement learning platform for artificial collective intelligence." Thirty-Second AAAI Conference on Artificial Intelligence. 2018.
[2].Shalev-Shwartz, Shai, Shaked Shammah, and Amnon Shashua. "Safe, multi-agent, reinforcement learning for autonomous driving." arXiv preprint arXiv:1610.03295 (2016).