torchMARL

This project is mainly the pytorch implementation of some MARL algorithms, currently including: VDN, QMIX, QTRAN, Qatten, Weighted QMIX(CW-QMIX, OW-QMIX), QPLEX.

Based on some existing implementation (Pymarl, StarCraft, QPLEX), the project simplifies the module and algorithm flow, improves visualization, and facilitates the establishment of your own algorithm library.

The hyper-parameters of these algorithms are the same as that in SMAC and referred in their source codes.

It has been tested on StarCraft II, but can be conveniently applied to any SMAC-like environments.

./envs includes some didactic environments used in QPLEX and an extra multi-goal environment: go_orderly. go_orderly generates a set of goals at the begin of a episode, and agents get rewards when they reach the corresponding goals. It's used for verifying the improvement like Hindsight Experience Replay.

Installation

Python >= 3.6
Pytorch >= 1.2
SMAC
Seaborn >= 0.9

For SMAC, here is a brief introduction to the installation under linux, Windows and other systems can refer to their repository.

Run the following command to install SMAC

pip install git+https://github.com/oxwhirl/smac.git
For StarCraft II installation, here is the link of 4.6.2, which is used by smac and they said performance is not always comparable between versions. Other versions can be found in Blizzard's repository. Password iagreetotheeula is required for decompression. The default path is ~/StarCraftII/. You can specify the path by modifying SC2PATH.
Download SMAC MAPS, move it to $SC2PATH/Maps after decompression.
Run python -m smac.bin.map_list to see the list of SMAC map. Run python -m smac.examples.random_agents to make sure that smac and its maps are properly installed.

Usage

Run the following command to experiment QMIX on 3s5z_vs_3s6z:

$ python -u main.py --map='3s5z_vs_3s6z' --alg='qmix' --max_steps=2000000 --epsilon_anneal_steps=50000 --num=5 --gpu='0'

Or open the project by IDE and run main.py.

Run run.sh to reproduce experiments on go_orderly.

You can find the description of SMAC maps at https://github.com/oxwhirl/smac/blob/master/docs/smac.md

Results

The experiment settings of all maps are the same as SMAC, and the difficulty is 7 (VeryHard)

Some pictures in ./imgs are the results of the former version.

3s_vs_5z QMIX

go_orderly


Returns	Win Rates

Replay

If you want to see the replay, make sure the replay_dir is an absolute path, which can be set in ./common/arguments.py. Then the replays of each evaluation will be saved, you can find them in your path.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
common		common
envs		envs
imgs		imgs
network		network
policy		policy
README.cn.md		README.cn.md
README.md		README.md
agents.py		agents.py
main.py		main.py
plot_compare.py		plot_compare.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torchMARL

Installation

Usage

Results

3s_vs_5z QMIX

go_orderly

Replay

About

Releases

Packages

Languages

jingranburangyongzhongwen/torchMARL

Folders and files

Latest commit

History

Repository files navigation

torchMARL

Installation

Usage

Results

3s_vs_5z QMIX

go_orderly

Replay

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages