Reinforcement-Learning

This repository contains lab and homework assignment for the course Reinforcement Learning offered in M.Sc Artificial Intelligence at the University of Amsterdam.

Labs

Lab 1 - Dynammic Programming, Policy Evaluation, Policy Iteration, Value Iteration, MC Control, TD Learning

In this lab we get familiar with basic concepts of Dynammic Programming and use it for the implementation of Policy Evaluation, Policy Iteration and Value Iteration for GridWorldEnv. We also implement Monte Carlo Prediction and Monte Carlo control with \epsilon-greedy policy on BlackjackEnv. Apart from that we also explore Temporal Difference Learning.

Approximate state-value functions for the blackjack policy that sticks only on 20 or 21, computed by Monte Carlo policy evaluation.

The optimal policy and state-value function for blackjack, found by Monte Carlo epsilon greedy policy

Problems and Solutions

Problema and solution can be found under ipynb files here.

Lab 2 - Deep-Q Networks, Monte Carlo Reinforce, Actor-Critic Methods

Problems and Solutions

Problem and solution can be found under ipynb files here.

Homeworks

Homework was is in colloboration with Dhruba Pujary. Problems and Solutions to the homework assignment can be found here.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
homework		homework
labs		labs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning

Labs

Lab 1 - Dynammic Programming, Policy Evaluation, Policy Iteration, Value Iteration, MC Control, TD Learning

Problems and Solutions

Lab 2 - Deep-Q Networks, Monte Carlo Reinforce, Actor-Critic Methods

Problems and Solutions

Homeworks

About

Uh oh!

Releases

Packages

Languages

KrishnaTarun/Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning

Labs

Lab 1 - Dynammic Programming, Policy Evaluation, Policy Iteration, Value Iteration, MC Control, TD Learning

Problems and Solutions

Lab 2 - Deep-Q Networks, Monte Carlo Reinforce, Actor-Critic Methods

Problems and Solutions

Homeworks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages