Skip to content
View wenqsun's full-sized avatar
  • The Hong Kong University of Science and Technology
  • 23:39 (UTC +08:00)

Highlights

  • Pro

Block or report wenqsun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userā€™s behavior. Learn more about reporting abuse.

Report abuse
Stars

šŸ¤”Reinforcement learning

This is a list of reinforcement learning resources.
13 repositories

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

Python 485 128 Updated Dec 1, 2022

An elegant, flexible, and superfast PyTorch deep reinforcement learning platform.

Python 5 4 Updated Sep 25, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,882 6,089 Updated Mar 10, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,766 678 Updated Feb 15, 2025

Implementation of benchmark RL algorithms

Python 466 81 Updated Jul 20, 2022

PyTorch implementation of SAC-Discrete.

Python 298 35 Updated Jul 25, 2024

Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

2,442 588 Updated Apr 11, 2022

A curated list of Diffusion Model in RL resources (continually updated)

1,016 56 Updated Feb 15, 2025

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 988 162 Updated Jul 18, 2024

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,171 190 Updated Feb 9, 2021

Intro to Reinforcement Learning (å¼ŗ化学习ēŗ²č¦ļ¼‰

3,323 495 Updated Jul 25, 2020

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rā€¦

Python 240 15 Updated Aug 19, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,493 715 Updated Mar 4, 2025