Update README.md

vcadillog · Nov 6, 2019 · 029a72e · 029a72e
1 parent af983d1
commit 029a72e
Showing 1 changed file with 9 additions and 1 deletion.
diff --git a/README.md b/README.md
@@ -35,9 +35,14 @@ The third level of the first world
 3 : SuperMarioBros-2-2-v0
 The second level of the second world
 ```
-## The agent was trained in the first level of Mario, and this is how it learned to finish it.
+The agent was trained in the first level of Mario, and this is how it learned to finish it.
+
 ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/mario.gif)
 
+Testing in not observed enviroments:
+
+![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_2.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_3.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_4.gif)
+
 ### This code was inspired from:
 * [1] Proximal Policy Optimization Algorithms. 
 
@@ -56,3 +61,6 @@ The second level of the second world
 
 * [5] The implementation of Super Mario Brothers by "Kautenja".
   https://github.com/Kautenja/gym-super-mario-bros
+
+### To do:
+* Implement meta learning and train in multiple enviroments for a more generalized actor.