Stars
Code for the paper Fine-Tuning Language Models from Human Preferences
A generative world for general-purpose robotics & embodied AI learning.
Implementation of TRPO and related algorithms
A toolkit for reproducible reinforcement learning research.
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
A connector for Claude Desktop to work with collection and sources on your Zotero Cloud.
The official Python SDK for Model Context Protocol servers and clients
An AI web browsing framework focused on simplicity and extensibility.
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
SGLang is a fast serving framework for large language models and vision language models.
Python tool for converting files and office documents to Markdown.
Official implementation: Large Language Models are Interpretable Learners - Google
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
Making Long-Context LLM Inference 10x Faster and 10x Cheaper
Repository for most of the code from my YouTube channel
A library of reinforcement learning components and agents
Multi-Joint dynamics with Contact. A general purpose physics simulator.
An educational resource to help anyone learn deep reinforcement learning.
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"