Approximate-RL-Methods-for-CartPole-Environment-on-OpenAI-gym

In tabular reinforcement learning methods which broadly includes dynamic programming, Monte-Carlo simulation methods and temporal difference methods (Q-learning, SARSA e.t.c), the state (V(s)) and state-action (Q(s,a)) values were stored in tables.
These methods are practical for simple and discrete environments, but when the states of the environment is continuous and/or very large, the tabular methods are limited. Quite frankly, most of our real-world environments are not discrete so approximate RL methods are developed to cater for these scenerios. Approximate RL methods use supervised machine learning (traditional and deep learning) methods to approximate the state-action values. In this repo, linear models (linear regression) is combined with RL methods (Q-learning and Monte-carlo) to approximate the state-action values, feature engineering (RBF) model is used to make the linear model approximate non-linear functions and applied to the cartpole environment in open-AI gym. Batch gradient descent with Monte Carlo is also applied to the cartpole environment in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CartPole_ApproximateSolution_Batch_GD.ipynb		CartPole_ApproximateSolution_Batch_GD.ipynb
CartPole_Approximate_Montecarlo.ipynb		CartPole_Approximate_Montecarlo.ipynb
CartPole_Approximate_Qlearning.ipynb		CartPole_Approximate_Qlearning.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Approximate-RL-Methods-for-CartPole-Environment-on-OpenAI-gym

About

Releases

Packages

Languages

Ogunfool/ApproximateRL_CartPole

Folders and files

Latest commit

History

Repository files navigation

Approximate-RL-Methods-for-CartPole-Environment-on-OpenAI-gym

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages