This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.
python
reinforcement-learning
monte-carlo
openai-gym
q-learning
policy
rl-agents
epsilon-greedy
dynamic-programming
markov-chains
approximation-algorithms
ucb1
q-lambda
exploration-exploitation
thomson-sampling
frozen-lake
multi-bandit-army
-
Updated
Feb 15, 2022 - Python