RL implementation on the gym-minigird environment Reference: https://github.com/lcswillems/pytorch-a2c-ppo https://github.com/openai/baselines https://github.com/maximecb/gym-minigrid