Update README.md

vcadillog · Nov 6, 2019 · 4d776f8 · 4d776f8
1 parent 302191f
commit 4d776f8
Showing 1 changed file with 4 additions and 3 deletions.
diff --git a/README.md b/README.md
@@ -58,6 +58,7 @@ Testing in not observed enviroments:
 ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_2.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_3.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_4.gif)
 
 ### This code was inspired from:
+
 * [1] Proximal Policy Optimization Algorithms. 
 
   https://arxiv.org/pdf/1707.06347.pdf
@@ -66,21 +67,21 @@ Testing in not observed enviroments:
 
   https://arxiv.org/pdf/1804.03720.pdf
 
-* [3] The implementation in tensorflow 1 of "coreystaten".
+* [3] The implementation of Ping Pong - Atari in tensorflow 1 of "coreystaten".
 
   https://github.com/coreystaten/deeprl-ppo
 
 * [4] Some of parameters of the convolutional neural network of "jakegrigsby".
 
   https://github.com/jakegrigsby/supersonic/tree/master/supersonic
 
-* [5] OpenAI Baselines of Atari and Retro wrappers.
+* [5] OpenAI Baselines of Atari and Retro wrappers for pre processing.
 
   https://github.com/openai/baselines/tree/master/baselines
 
 * [6] The implementation of Super Mario Brothers by "Kautenja".
 
   https://github.com/Kautenja/gym-super-mario-bros
 
-### To do:
+### What to do now?
 * Implement meta learning and train in multiple enviroments for a more generalized actor.