Clean and flexible implementation of PPO (built on top of stable-baselines3)
-
Updated
Jul 9, 2021 - Python
Clean and flexible implementation of PPO (built on top of stable-baselines3)
Recurrent Policies for Handling Partially Observable Environments
Simple and Modular implementation of Proximal Policy Optimization (PPO) in PyTorch
An autonomous agent that learns to play Atari Bowling using Reinforcement Learning and Proximal Policy Optimization
You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.
Balancing a Ball with Reinforcement Learning
Implement PPO to solve Crawler problem in Unity
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
XOXO² - Use Reinforcement Learning to train agent to play U_T-T-T.
A pytorch project to easily run experiments on OpenAI's Procgen Benchmark
Unlocking the Power of Generative AI: In-Context Learning, Instruction Fine-Tuning and Reinforcement Learning Fine-Tuning.
Modular Deep RL infrastructure in PyTorch
This repository contains my assignment solutions for the Deep Reinforcement Learning course (430.729_003) offered by Seoul National University (Spring 2020).
A custom Gym environment for a Rock-Paper-Scissors game, where a reinforcement learning agent and a CNN model are trained, evaluated, and compared using Ray RLlib and TensorFlow.
Cognitive Generative Intelligent Task Offloading for Digital Twins of Vehicular Networks This repository contains the code and resources for the implementation of cognitive generative intelligent task offloading in digital twins for vehicular networks.
Training a PPO to balance a pendulum in a fully observable environment.
Noise-Adaptive Driving Assistance System (NADAS) using Deep Reinforcement Learning, State-Estimation & State Representation
Reinforcement learning agent for playing Flappy Bird, as part of a university project
Evaluating the impact of curriculum learning on the training process for an intelligent agent in a video game
Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."