Update README.md

vcadillog · Nov 6, 2019 · 302191f · 302191f
1 parent e71efd2
commit 302191f
Showing 1 changed file with 7 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -42,10 +42,17 @@ The second level of the second world
 ```
 
 To change the enviroments, modify the Enviroments.py file.
+
 Eight actors were trained in the first level of Mario, and this is how it learned to finish it.
 
 ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/mario.gif)
 
+A plot how the average reward evolved vs the time steps, the model trained in four steps due connection, the reward isn't the same as the raw output of Kautenja's implementation, it was previously scaled for this model, all the data pre processing is in the Datapreprocessing.py file.
+
+![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/log1.PNG)![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/log2.PNG)![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/log3.PNG)![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/log4.PNG)
+
+In the logs directory you can find two more plots, for average X_position and Max_X_position. 
+
 Testing in not observed enviroments:
 
 ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_2.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_3.gif) ![alt text](https://github.com/vcadillog/PPO-Mario-Bros-Tensorflow-2/blob/master/images/test_4.gif)