Lists (1)
Sort Name ascending (A-Z)
Stars
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
800,000 step-level correctness labels on LLM solutions to MATH problems
AgentLLM is a PoC for browser-native autonomous agents
With this WhatsApp app, you can chat in any language with a Large language models (LLM) on Amazon Bedrock. Send voice notes and receive transcriptions. By making a minor change in the code, you can…
Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.
Robust recipes to align language models with human and AI preferences
Efficient Training (including pre-training and fine-tuning) for Big Models
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
A quick guide (especially) for trending instruction finetuning datasets
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
Distributionally robust neural networks for group shifts
Code and documentation to train Stanford's Alpaca models, and generate the data.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
PyTorch extensions for high performance and large scale training.
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Crawl & Visualize ICLR 2023 Data from OpenReview
Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
LimSim & LimSim++: Integrated traffic and autonomous driving simulators with (M)LLM support