Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
vcadillog authored Nov 6, 2019
1 parent af983d1 commit 029a72e
Showing 1 changed file with 9 additions and 1 deletion.
10 changes: 9 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,14 @@ The third level of the first world
3 : SuperMarioBros-2-2-v0
The second level of the second world
```
## The agent was trained in the first level of Mario, and this is how it learned to finish it.
The agent was trained in the first level of Mario, and this is how it learned to finish it.

![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/mario.gif)

Testing in not observed enviroments:

![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_2.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_3.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_4.gif)

### This code was inspired from:
* [1] Proximal Policy Optimization Algorithms.

Expand All @@ -56,3 +61,6 @@ The second level of the second world

* [5] The implementation of Super Mario Brothers by "Kautenja".
https://github.com/Kautenja/gym-super-mario-bros

### To do:
* Implement meta learning and train in multiple enviroments for a more generalized actor.

0 comments on commit 029a72e

Please sign in to comment.