Skip to content
View denisyarats's full-sized avatar

Block or report denisyarats

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ExORL: Exploratory Data for Offline Reinforcement Learning

Python 107 9 Updated Feb 8, 2022
Python 337 54 Updated Oct 12, 2022

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Python 373 88 Updated May 31, 2022

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 650 69 Updated Oct 26, 2022

The Differentiable Cross-Entropy Method

Jupyter Notebook 125 12 Updated Aug 14, 2020

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Python 102 17 Updated Mar 24, 2023

DrQ: Data regularized Q

Jupyter Notebook 412 53 Updated Jan 13, 2023

higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.

Python 1,603 124 Updated Mar 25, 2022

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Python 237 34 Updated May 3, 2020

We release dataset collected for our research, code that implement neural network models described in the paper, and scripts to reproduce all of our results, and visualization tool for visualize da…

C++ 161 29 Updated Sep 2, 2021

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Python 1,515 186 Updated Apr 13, 2024

Turn Python scripts into handouts with Markdown and figures

Python 2,018 106 Updated Aug 28, 2021

Implementations of quasi-hyperbolic optimization algorithms.

Python 102 11 Updated May 15, 2020

LSTM and QRNN Language Model Toolkit for PyTorch

Python 1,966 488 Updated Feb 12, 2022

Starcraft AI Research Dataset

Python 572 75 Updated Aug 30, 2021

Submanifold sparse convolutional networks

C++ 2,085 332 Updated Jan 9, 2024

An End-To-End, Lightweight and Flexible Platform for Game Research

C++ 2,085 284 Updated Aug 30, 2021

InferSent sentence embeddings

Jupyter Notebook 2,285 470 Updated Aug 30, 2021

torch TH/THC c++11 wrapper

C 14 3 Updated Jun 14, 2017

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python 1,379 278 Updated May 4, 2020

Facebook AI Research Sequence-to-Sequence Toolkit

Lua 3,742 614 Updated Sep 17, 2021

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,498 2,094 Updated Nov 3, 2023

stochs: fast stochastic solvers for machine learning in C++ and Cython

C++ 26 8 Updated Oct 13, 2022

A library for efficient similarity search and clustering of dense vectors.

C++ 32,538 3,717 Updated Jan 24, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,152 23,183 Updated Jan 24, 2025

Torch implementation of DeepMask and SharpMask

Lua 3,114 506 Updated Jan 16, 2019

A toolkit for making real world machine learning and data analysis applications in C++

C++ 13,719 3,389 Updated Jan 24, 2025

header only, dependency-free deep learning framework in C++14

C++ 5,875 1,384 Updated Apr 17, 2022

Generative adversarial networks (GAN) applied to sequential data via recurrent neural networks (RNN).

Python 395 124 Updated Jun 19, 2017

Theano implementation of Tree RNNs aka Recursive Neural Networks.

Python 236 50 Updated Aug 15, 2016
Next