Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Mastering Diverse Domains through World Models
Train transformer language models with reinforcement learning.
Schedule-Free Optimization in PyTorch
Fine-tune LLM agents with online reinforcement learning
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Reference implementation for DPO (Direct Preference Optimization)
FastAPI framework, high performance, easy to learn, fast to code, ready for production
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"
Sample code illustrating the VS Code extension API.
🦌 Soothing pastel theme for VSCode & Azure Data Studio
An innovative superfamily of fonts for code
A Redis Plugin for GenKit that adds Redis for efficient state storage, trace storage, caching, and rate limiting.
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
A framework for few-shot evaluation of language models.