- Berkeley
- www.kostrikov.xyz
- @ikostrikov
-
-
-
jaxrl Public template
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
-
-
-
pytorch-a2c-ppo-acktr-gail Public
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
-
d4rl Public
Forked from Farama-Foundation/D4RLA benchmark for offline reinforcement learning.
Python Apache License 2.0 UpdatedApr 7, 2022 -
gym-wordle Public
Forked from zach-lawless/gym-wordleGym environment for playing Wordle with RL agents
Python UpdatedFeb 8, 2022 -
gym Public
Forked from openai/gymA toolkit for developing and comparing reinforcement learning algorithms.
-
-
-
unitree_sim Public
Forked from vikashplus/unitree_simMuJoCo models for Unitree Robots
1 UpdatedDec 15, 2021 -
-
doodad Public
Forked from rail-berkeley/doodad -
roboverse Public
Forked from avisingh599/roboverseA set of environments utilizing pybullet for simulation of robotic manipulation tasks.
-
Implicit-Q-Learning Public
Forked from BY571/Implicit-Q-LearningPyTorch implementation of the implicit Q-learning algorithm (IQL)
-
motion_imitation Public
Forked from erwincoumans/motion_imitationCode accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"
-
oatomobile Public
Forked from OATML/oatomobileA research framework for autonomous driving
Python Apache License 2.0 UpdatedJul 13, 2021 -
pytorch-flows Public
PyTorch implementations of algorithms for density estimation
-
SMAAC Public
Forked from sunghoonhong/SMAACThis repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".
Python Mozilla Public License 2.0 UpdatedMay 6, 2021 -
pytorch-ddpg-naf Public
Implementation of algorithms for continuous control (DDPG and NAF).
-
-
pytorch-a3c Public
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
-
mazelab Public
Forked from zuoxingdong/mazelabA customizable framework to create maze and gridworld environments
-
pytorch-trpo Public
PyTorch implementation of Trust Region Policy Optimization
-
Mine_tf2.0 Public
Forked from MasanoriYamada/Mine_pytorchMINE: Mutual Information Neural Estimation in pytorch
-
pytorch-rl Public
-
pytorch-meta-optimizer Public
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
-
-
TensorFlow-Pointer-Networks Public
TensorFlow implementation of Pointer Networks