Curvature Corrected Moving Average: An accurate and model-free path smoothing algorithm.
-
Updated
Dec 23, 2025 - Python
Curvature Corrected Moving Average: An accurate and model-free path smoothing algorithm.
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
ExORL: Exploratory Data for Offline Reinforcement Learning
Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated Attention multi-modal fusion) for Task-Oriented Language Grounding (tasks specified by natural language instructions) e.g. "Pick up the Cup or else"
Code release for SceneReplica paper.
Code for IEEE MLSP 2021 paper titled "Model-Free Learning of Optimal Deterministic Resource Allocations in Wireless Systems via Action-Space Exploration"
A Pytorch implementation of The Visual Centrifuge: Model-Free Layered Video Representations.
Deep Reinforcement Learning implementation in Keras of an AI controlling the popular Flappy Bird videogame, using Asynchronous Advantage Actor Critic (A3C)
Network inference via Event Space Linearization (ESL)
Project for the course "Foundations of Reinforcement Learning" 2021 at ETH Zurich
Sample Policy Gradient
A Liquid RL framework for Autonomous Cyber Defence
A compact PyTorch implementation of the REINFORCE algorithm enhanced with a learned baseline (value network).This repository trains a policy network on OpenAI Gym's CartPole-v1 environment using Monte Carlo returns and a value function baseline to reduce variance.
Add a description, image, and links to the model-free topic page so that developers can more easily learn about it.
To associate your repository with the model-free topic, visit your repo's landing page and select "manage topics."