Skip to content
View StigLidu's full-sized avatar
🤪
Crazy !!!
🤪
Crazy !!!

Highlights

  • Pro

Block or report StigLidu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,607 248 Updated Dec 27, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,706 356 Updated Jan 13, 2025

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 191 10 Updated Dec 30, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,695 584 Updated Jan 8, 2025

🙌 OpenHands: Code Less, Make More

Python 43,128 4,766 Updated Jan 13, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 7,256 695 Updated Jan 13, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,385 227 Updated Jan 10, 2025

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,877 2,311 Updated Jan 10, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 22,513 1,829 Updated Jan 12, 2025

Simple and efficient pytorch-native transformer training and inference (batched)

Python 66 4 Updated Apr 2, 2024

LLM inference in C/C++

C++ 70,613 10,194 Updated Jan 12, 2025

[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge"

Python 11 Updated Nov 5, 2024

Entropy Based Sampling and Parallel CoT Decoding

Python 3,189 316 Updated Nov 13, 2024

ORLM: Training Large Language Models for Optimization Modeling

Python 84 12 Updated Nov 4, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,142 376 Updated Dec 6, 2024

Repository for the paper Stream of Search: Learning to Search in Language

Python 117 14 Updated Aug 10, 2024

A framework for few-shot evaluation of language models.

Python 7,433 1,999 Updated Jan 10, 2025

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,504 356 Updated Feb 5, 2024

RewardBench: the first evaluation tool for reward models.

Python 486 56 Updated Jan 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,600 5,134 Updated Jan 13, 2025
Python 228 11 Updated Jan 11, 2025

OptiBench and ReSocratic Synthesis Method

Python 12 Updated Oct 8, 2024

A curated list of fellowships for graduate students in Computer Science and related fields.

583 61 Updated Nov 10, 2024
Python 75 129 Updated Sep 24, 2024

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

Python 48 4 Updated Oct 16, 2024
SAS 383 34 Updated Sep 27, 2023

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 84 6 Updated Feb 9, 2024

The MATH Dataset (NeurIPS 2021)

Python 977 88 Updated Aug 5, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 109 11 Updated Sep 9, 2024
Next