forked from NeuronDance/DeepRL
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathNick_Sun
55 lines (49 loc) · 7.79 KB
/
Nick_Sun
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
Book
=====
Shoham, Yoav, and Kevin Leyton-Brown. Multiagent systems: Algorithmic, game-theoretic, and logical foundations. Cambridge University Press, 2008. http://www.masfoundations.org/download.html
Overview
=====
Buşoniu, Lucian, Robert Babuška, and Bart De Schutter. "Multi-agent reinforcement learning: An overview." Innovations in multi-agent systems and applications-1. Springer, Berlin, Heidelberg, 2010. 183-221. http://www.dcsc.tudelft.nl/~bdeschutter/pub/rep/10_003.pdf
(Code: http://busoniu.net/repository.php)
Dobrev, Dimiter. "The Definition of AI in Terms of Multi Agent Systems." arXiv preprint arXiv:1210.0887 (2012). https://arxiv.org/ftp/arxiv/papers/1210/1210.0887.pdf
Multiagent-Reinforcement-Learning (ppt), 2013. : http://www.ecmlpkdd2013.org/wp-content/uploads/2013/09/Multiagent-Reinforcement-Learning.pdf
Kapoor, Sanyam. "Multi-agent reinforcement learning: A report on challenges and approaches." arXiv preprint arXiv:1807.09427 (2018). https://arxiv.org/abs/1807.09427
Algorithm
=====
(JAL) Claus, Caroline, and Craig Boutilier. "The dynamics of reinforcement learning in cooperative multiagent systems." AAAI/IAAI 1998.746-752 (1998): 2. https://www.aaai.org/Papers/AAAI/1998/AAAI98-106.pdf
(Distributed Q-learning) Lauer, Martin, and Martin Riedmiller. "An algorithm for distributed reinforcement learning in cooperative multi-agent systems." In Proceedings of the Seventeenth International Conference on Machine Learning. 2000. https://www.researchgate.net/publication/2641625_An_Algorithm_for_Distributed_Reinforcement_Learning_in_Cooperative_Multi-Agent_Systems
(team Q-learning) Littman, Michael L. "Value-function reinforcement learning in Markov games." Cognitive Systems Research 2.1 (2001): 55-66. http://www.sts.rpi.edu/~rsun/si-mal/article3.pdf
(FMQ) Kapetanakis, Spiros, and Daniel Kudenko. "Reinforcement learning of coordination in cooperative multi-agent systems." AAAI/IAAI 2002 (2002): 326-331. https://www.aaai.org/Papers/AAAI/2002/AAAI02-050.pdf
(OAL) Wang, Xiaofeng, and Tuomas Sandholm. "Reinforcement learning to play an optimal Nash equilibrium in team Markov games." Advances in neural information processing systems. 2003. https://papers.nips.cc/paper/2171-reinforcement-learning-to-play-an-optimal-nash-equilibrium-in-team-markov-games.pdf
Qi, Dehu, and Ron Sun. "A multi-agent system integrating reinforcement learning, bidding and genetic algorithms." Web Intelligence and Agent Systems: An International Journal 1.3, 4 (2003): 187-202. https://pdfs.semanticscholar.org/2cb8/885ea3d8d6bccde87153f18f8be7f23ff935.pdf
(TEAM-Q) Wang, Ying, and Clarence W. De Silva. "Multi-robot box-pushing: Single-agent q-learning vs. team q-learning." 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2006. https://ieeexplore.ieee.org/document/4058979
(FMQ) Matignon, Laëtitia, Guillaume Laurent, and Nadine Le Fort-Piat. "A study of FMQ heuristic in cooperative multi-agent games." 2008. https://www.researchgate.net/publication/29616600_A_study_of_FMQ_heuristic_in_cooperative_multi-agent_games
(MADDPG) Lowe, Ryan, et al. "Multi-agent actor-critic for mixed cooperative-competitive environments." Advances in Neural Information Processing Systems. 2017. https://arxiv.org/abs/1706.02275
Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients." Thirty-Second AAAI Conference on Artificial Intelligence. 2018. https://arxiv.org/abs/1705.08926
Cooperation
=====
Kok, Jelle R., and Nikos Vlassis. "Sparse cooperative Q-learning." Proceedings of the twenty-first international conference on Machine learning. ACM, 2004. https://icml.cc/Conferences/2004/proceedings/papers/267.pdf
Crandall, Jacob W., and Michael A. Goodrich. "Learning to compete, compromise, and cooperate in repeated general-sum games." Proceedings of the 22nd international conference on Machine learning. ACM, 2005. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.448.8292&rep=rep1&type=pdf
Panait, Liviu, and Sean Luke. "Cooperative multi-agent learning: The state of the art." Autonomous agents and multi-agent systems 11.3 (2005): 387-434. https://cs.gmu.edu/~eclab/papers/panait05cooperative.pdf
Kok, Jelle R., and Nikos Vlassis. "Collaborative multiagent reinforcement learning by payoff propagation." Journal of Machine Learning Research 7.Sep (2006): 1789-1828. http://www.jmlr.org/papers/volume7/kok06a/kok06a.pdf
De Cote, Enrique Munoz, Alessandro Lazaric, and Marcello Restelli. "Learning to cooperate in multi-agent social dilemmas." AAMAS. Vol. 6. 2006. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.335&rep=rep1&type=pdf
Ma, Jie, and Stephen Cameron. "Combining policy search with planning in multi-agent cooperation." Robot Soccer World Cup. Springer, Berlin, Heidelberg, 2008. https://www.researchgate.net/publication/220797588_Combining_Policy_Search_with_Planning_in_Multi-agent_Cooperation
Tampuu, Ardi, et al. "Multiagent cooperation and competition with deep reinforcement learning." PloS one 12.4 (2017): e0172395. https://arxiv.org/abs/1511.08779
Coordination
=====
Kapetanakis, Spiros, and Daniel Kudenko. "Reinforcement learning of coordination in cooperative multi-agent systems." AAAI/IAAI 2002 (2002): 326-331. https://www.aaai.org/Papers/AAAI/2002/AAAI02-050.pdf
Lau, Qiangfeng Peter, Mong Li Lee, and Wynne Hsu. "Coordination guided reinforcement learning." Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems-Volume 1. International Foundation for Autonomous Agents and Multiagent Systems, 2012. http://www.ifaamas.org/Proceedings/aamas2012/papers/1B_1.pdf
Zhang, Chongjie, and Victor Lesser. "Coordinating multi-agent reinforcement learning with limited communication." Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems. International Foundation for Autonomous Agents and Multiagent Systems, 2013. https://pdfs.semanticscholar.org/5e7b/0822821575555e318845531b6d5b2d359b18.pdf
Hao, Jianye, et al. "Reinforcement social learning of coordination in networked cooperative multiagent systems." Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence. 2014. http://mipc.inf.ed.ac.uk/2014/papers/mipc2014_hao_etal.pdf
Le, Hoang M., et al. "Coordinated multi-agent imitation learning." Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 2017. https://arxiv.org/abs/1703.03121
Khadka, Shauharda, Somdeb Majumdar, and Kagan Tumer. "Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination." arXiv preprint arXiv:1906.07315 (2019). https://arxiv.org/abs/1906.07315
Communicate
=====
Varshavskaya, Paulina, Leslie Pack Kaelbling, and Daniela Rus. "Efficient distributed reinforcement learning through agreement." Distributed Autonomous Robotic Systems 8. Springer, Berlin, Heidelberg, 2009. 367-378. https://www.researchgate.net/publication/241128592_Efficient_Distributed_Reinforcement_Learning_Through_Agreement
Hausknecht, Matthew John. Cooperation and communication in multiagent deep reinforcement learning. Diss. 2016. http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/thesis.pdf
Sukhbaatar, Sainbayar, and Rob Fergus. "Learning multiagent communication with backpropagation." Advances in Neural Information Processing Systems. 2016. https://arxiv.org/abs/1605.07736
Foerster, Jakob, et al. "Learning to communicate with deep multi-agent reinforcement learning." Advances in Neural Information Processing Systems. 2016. https://arxiv.org/abs/1605.06676
Application
=====
Zheng, Lianmin, et al. "MAgent: A many-agent reinforcement learning platform for artificial collective intelligence." Thirty-Second AAAI Conference on Artificial Intelligence. 2018. https://arxiv.org/abs/1712.00600
Shalev-Shwartz, Shai, Shaked Shammah, and Amnon Shashua. "Safe, multi-agent, reinforcement learning for autonomous driving." arXiv preprint arXiv:1610.03295 (2016). https://arxiv.org/abs/1610.03295