Reinforcement Algorithms - Policy Gradient, Q Learning, Double Q Learning, Deep Q Learning and Double Deep Q Learning
-
Updated
Nov 17, 2018 - Python
Reinforcement Algorithms - Policy Gradient, Q Learning, Double Q Learning, Deep Q Learning and Double Deep Q Learning
Reinforcement Learning backend for autonomous driving using Proximal Policy Optimization (PPO)
A Grid Based RL Environment & Implementaions of few Deep-RL Algorithms.
Add a description, image, and links to the rl-algorithms-pytorch topic page so that developers can more easily learn about it.
To associate your repository with the rl-algorithms-pytorch topic, visit your repo's landing page and select "manage topics."