Skip to content
#

policy-improvement

Here are 9 public repositories matching this topic...

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

  • Updated Apr 3, 2019
  • Jupyter Notebook

This repository contains two reinforcement learning projects: "Treasure Hunt in the Frozen Lake," which navigates a modified FrozenLake using dynamic programming, and "Optimizing Movie Recommendations," which employs Multi-Armed Bandits to enhance user satisfaction.

  • Updated Feb 27, 2025
  • Jupyter Notebook

A web-based interactive Grid World environment for learning and visualizing reinforcement learning algorithms including policy evaluation, policy improvement, and value iteration. Built with Flask backend implementing RL algorithms and JavaScript frontend for grid visualization.

  • Updated Mar 13, 2025
  • JavaScript

Based on the book --- Reinforcement Learning: An Introduction (2nd ed, 2018) by Sutton and Barto. For the Reinforcement Learning course Assignment 2 (see Gridworld Problem 1.pdf) at Memorial University of Newfoundland, Jul. 18, 2024

  • Updated Aug 12, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the policy-improvement topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-improvement topic, visit your repo's landing page and select "manage topics."

Learn more