Skip to content

Some examples of methods for solving multi-armed bandits problems

Notifications You must be signed in to change notification settings

scascino10/multi_armed_bandits

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

classic_examples/ contains some reproductions of the multi-armed bandit problem examples from "Reinforcement Learning An Introduction" by Sutton and Barto (Chapter 2.3) alt text

gumbel/ contains an implementation of Algorithm 2 from the paper Policy Improvement By Planning With Gumbel alt text

About

Some examples of methods for solving multi-armed bandits problems

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published