Pinned Loading
-
rl-book-challenge
rl-book-challenge Publicself-studying the Sutton & Barto the hard way
-
two-step-task
two-step-task PublicImplementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforcement Learn".
-
spinning-up-a-Pong-AI-with-deep-RL
spinning-up-a-Pong-AI-with-deep-RL PublicCode for "Spinning Up a Pong AI With Deep RL" on FloydHub.
-
gym-alttp-gridworld
gym-alttp-gridworld PublicA gym environment for Stuart Armstrong's model of a treacherous turn.
JavaScript 17
-
quantilizers
quantilizers PublicCode from "How useful is quantilization for mitigating specification-gaming?"
Python 3
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.