Skip to content
View mitultiwari's full-sized avatar

Block or report mitultiwari

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,269 163 Updated Jul 25, 2023

A generative world for general-purpose robotics & embodied AI learning.

Python 23,168 1,927 Updated Jan 22, 2025

Implementation of TRPO and related algorithms

Python 624 157 Updated May 20, 2018

A toolkit for reproducible reinforcement learning research.

Python 1,910 310 Updated May 4, 2023

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 2,929 801 Updated Jun 10, 2023

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,956 4,880 Updated Aug 1, 2024

Assignments for CS294-112.

Python 1,578 1,043 Updated Mar 24, 2023

A collection of MCP servers.

1,964 195 Updated Jan 20, 2025

A connector for Claude Desktop to work with collection and sources on your Zotero Cloud.

TypeScript 10 5 Updated Dec 20, 2024

The official Python SDK for Model Context Protocol servers and clients

Python 1,475 150 Updated Jan 22, 2025

An AI web browsing framework focused on simplicity and extensibility.

TypeScript 5,307 230 Updated Jan 22, 2025

Python SDK for Browserbase

Python 11 1 Updated Jan 22, 2025

Deprecated Browserbase Python SDK

Python 10 3 Updated Nov 1, 2024
TeX 132 26 Updated Mar 7, 2018

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,284 192 Updated Jan 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,559 731 Updated Jan 22, 2025

Python tool for converting files and office documents to Markdown.

Python 35,355 1,567 Updated Jan 16, 2025

Official implementation: Large Language Models are Interpretable Learners - Google

10 1 Updated Jun 29, 2024

Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.

Rust 15,886 476 Updated Jan 22, 2025

Making Long-Context LLM Inference 10x Faster and 10x Cheaper

Python 373 39 Updated Jan 22, 2025
Jupyter Notebook 162 222 Updated Dec 6, 2024

Probabilistic Machine Learning: Advanced Topics

1,426 120 Updated Nov 26, 2024

Repository for most of the code from my YouTube channel

Python 886 485 Updated Jul 24, 2023

A library of reinforcement learning components and agents

Python 3,567 442 Updated Jan 14, 2025

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Jupyter Notebook 8,606 871 Updated Jan 22, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,381 2,264 Updated Aug 5, 2024

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Python 1,579 278 Updated Oct 31, 2019
Next