PyTorch Pommerman

This is a PyTorch starting point for experimenting with ideas for the Pommerman competitions (https://www.pommerman.com/)

The reinforcement learning codebase is based upon Ilya Kostrikov's awesome work (https://github.com/ikostrikov/pytorch-a2c-ppo-acktr)

It requires the Pommerman playground (https://github.com/MultiAgentLearning/playground) to be installed in your Python environment, in addition to any dependencies of pytorch-a2c-ppo-acktr.

UPDATE:

# install depends of playgroud project
cd playground
conda env create -f env.yml

# install extra depends of pytorch-pommerman-rl
cd pytorch-pommerman-rl
conda activate pommerman
conda env update --file env.yml  --prune

# run the training command
python main.py --use-gae --env-name PommeFFAPartialFast-v0 --no-norm --seed 42 --algo a2c --lr-schedule 250000000 --num-steps 1000 --num-frames 5e8 --num-stack 5

Usage

With the spatial feature representation and CNN based models, I've been able to train an agent FFA play that does quite well (> 95% win rate). Without reward shaping, it does not learn to bomb, but it does a great job of evading and letting the other agents blow themselves up.

python main.py --use-gae --env-name PommeFFACompetitionFast-v0 --no-norm --seed 42 --algo a2c --lr-schedule 25000000

Below is a training curve for above command. Note that it shows the training reward (non-deterministic), not evaluation which is higher.

Name		Name	Last commit message	Last commit date
Latest commit History 183 Commits
agents		agents
algo		algo
envs		envs
helpers		helpers
imgs		imgs
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
distributions.py		distributions.py
enjoy.py		enjoy.py
env.yml		env.yml
main.py		main.py
packages_used.txt		packages_used.txt
packages_used_specs.txt		packages_used_specs.txt
replay_storage.py		replay_storage.py
requirements.txt		requirements.txt
rollout_storage.py		rollout_storage.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Pommerman

Usage

About

Releases

Packages

Languages

License

Seubill/pytorch-pommerman-rl

Folders and files

Latest commit

History

Repository files navigation

PyTorch Pommerman

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages