Skip to content
View anair13's full-sized avatar

Block or report anair13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of Offline to Online RL papers (continually updated)

38 Updated Sep 9, 2024

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 478 53 Updated Dec 6, 2024

hand importer and dataset creation tool for poker

Python 1 Updated Jan 28, 2024

The official Python library for the OpenAI API

Python 24,986 3,651 Updated Mar 9, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 102,857 16,660 Updated Mar 10, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 143 Updated Aug 3, 2023
Python 14 1 Updated Mar 8, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 77,851 9,329 Updated Jan 4, 2025

Example notebooks for Reverse Engineering the Neural Tangent Kernel

Jupyter Notebook 9 Updated Jun 17, 2022

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,660 1,385 Updated Nov 4, 2024

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

310 17 Updated Nov 21, 2022

🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet

182 9 Updated Jan 6, 2023

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,493 715 Updated Mar 4, 2025

PyTorch dataset extended with map, cache etc. (tensorflow.data like)

Python 328 18 Updated Jun 13, 2022

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Python 345 32 Updated Jul 22, 2020

CNN for rope state estimation

Python 2 Updated Jan 17, 2020

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++ 1,478 151 Updated Mar 7, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,704 173 Updated Mar 6, 2025

Image Visualization Tools (object detection, semantic and instance segmentation)

Python 252 30 Updated Nov 22, 2024

Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World

Python 33 5 Updated Jul 24, 2023

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 317 52 Updated Mar 21, 2023

Advantage weighted Actor Critic for Offline RL

Python 50 8 Updated Aug 27, 2022

Tasks to get you started with MineRL

Python 40 7 Updated Jan 6, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,536 2,793 Updated Aug 15, 2024

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

223 29 Updated Jun 16, 2024

An offline deep reinforcement learning library

Python 1,402 246 Updated Mar 7, 2025

Collection of reinforcement learning algorithms

Python 2,609 557 Updated Jun 17, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,562 272 Updated Feb 5, 2025

SAPIEN Embodied AI Platform

C++ 506 45 Updated Mar 3, 2025

A PyTorch implementation of Implicit Q-Learning

Python 74 10 Updated Oct 23, 2021
Next