Records on previously trained policies
Use to find optimized hyperparameters for RL agent
Sample output of 50 Optuna trials in Hyperparameter_Optimizer.ipynb
Use to train policy network to play Ms. Pac-Man
Sample output of 100 training episodes in MS_Pacman_PG.ipynb
Policy network object of RL agent trained on Ms. Pac-Man
Optuna study data after final changes to rewards system
Functions for visual representation of training/testing data
Policy network based off of Github user ritesh-kanchi
This code was not used later on for this project.
Visualization of training progress for our final version of policy network
Trained policy network objects
Created during development of improved policy
Not final versions of policy
Final version of policy trained on 10000 episodes
Gameplay renderings of different trained policies
Gameplay of policy network from Hugging Face Deep Reinforcement Learning course with minimal changes and 10000 episodes of training
Created during development of improved policy
Not gameplay from final versions of policy
Best and worst gameplay of final trained policy over 1000 episodes
Scores of each episode during training sessions
Training of final version of policy network over 10000 episodes
Training of initial version of policy network from Hugging Face Deep Reinforcement Learning course with minimal changes over 10000 episodes
Optuna study data for finding the best hyperparameters for training Ms. Pac-Man RL agent
Study ran after making final changes to reward system