Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
vcadillog authored Nov 6, 2019
1 parent 302191f commit 4d776f8
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,7 @@ Testing in not observed enviroments:
![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_2.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_3.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_4.gif)

### This code was inspired from:

* [1] Proximal Policy Optimization Algorithms.

https://arxiv.org/pdf/1707.06347.pdf
Expand All @@ -66,21 +67,21 @@ Testing in not observed enviroments:

https://arxiv.org/pdf/1804.03720.pdf

* [3] The implementation in tensorflow 1 of "coreystaten".
* [3] The implementation of Ping Pong - Atari in tensorflow 1 of "coreystaten".

https://github.com/coreystaten/deeprl-ppo

* [4] Some of parameters of the convolutional neural network of "jakegrigsby".

https://github.com/jakegrigsby/supersonic/tree/master/supersonic

* [5] OpenAI Baselines of Atari and Retro wrappers.
* [5] OpenAI Baselines of Atari and Retro wrappers for pre processing.

https://github.com/openai/baselines/tree/master/baselines

* [6] The implementation of Super Mario Brothers by "Kautenja".

https://github.com/Kautenja/gym-super-mario-bros

### To do:
### What to do now?
* Implement meta learning and train in multiple enviroments for a more generalized actor.

0 comments on commit 4d776f8

Please sign in to comment.