Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
images		images
saved_models		saved_models
Auxiliars.py		Auxiliars.py
Common_constants.py		Common_constants.py
Datapreprocessing.py		Datapreprocessing.py
Enviroments.py		Enviroments.py
Main.py		Main.py
MultiEnv.py		MultiEnv.py
NeuralNets.py		NeuralNets.py
PPO.py		PPO.py
README.md		README.md

Repository files navigation

PPO-Mario-Bros-Tensorflow-2

A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.

Requeirements:

Tensorflow 2
OpenCV
OpenIA gym
Super Mario Bros NES, developed by Kautenja

Installing:

Clone the repository, Change the path to the cloned repository

import os
os.chdir('./PPO-Mario-Bros-Tensorflow-2')

For training, run:

python -c 'from Main import train; train(True)'

The argument of training enables the load of weights of the trained model.

For testing the model:

python -c 'from Main import test; test(10,0)'

Where the first argument of test is the number of episodes to test the model, and the second is the number of the enviroment to test, for the code the enviroments of test are the next ones:

0 : SuperMarioBros-1-1-v0
The first level of the first world
1 : SuperMarioBros-1-2-v0 
The second level of the first world
2 : SuperMarioBros-1-3-v0
The third level of the first world
3 : SuperMarioBros-2-2-v0
The second level of the second world

To change the enviroments, modify the Enviroments.py file. Eight actors were trained in the first level of Mario, and this is how it learned to finish it.

Testing in not observed enviroments:

This code was inspired from:

[1] Proximal Policy Optimization Algorithms.

https://arxiv.org/pdf/1707.06347.pdf
[2] Gotta Learn Fast: A New Benchmark for Generalization in RL.

https://arxiv.org/pdf/1804.03720.pdf
[3] The implementation in tensorflow 1 of "coreystaten".

https://github.com/coreystaten/deeprl-ppo
[4] Some of parameters of the convolutional neural network of "jakegrigsby".

https://github.com/jakegrigsby/supersonic/tree/master/supersonic
[5] OpenAI Baselines of Atari and Retro wrappers.

https://github.com/openai/baselines/tree/master/baselines
[6] The implementation of Super Mario Brothers by "Kautenja".

https://github.com/Kautenja/gym-super-mario-bros

To do:

Implement meta learning and train in multiple enviroments for a more generalized actor.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO-Mario-Bros-Tensorflow-2

Requeirements:

Installing:

This code was inspired from:

To do:

About

Releases

Packages

Languages

vcadillog/PPO-Mario-Bros-Tensorflow-2

Folders and files

Latest commit

History

Repository files navigation

PPO-Mario-Bros-Tensorflow-2

Requeirements:

Installing:

This code was inspired from:

To do:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages