StigLidu

🤪

Crazy !!!

Weihua Du StigLidu

🤪

Crazy !!!

Ph.D. @ LTI, CMU / B. Eng. @ IIIS, THU

39 followers · 30 following

CMU
Pittsburgh, US
01:30 (UTC -05:00)
https://stiglidu.github.io/

Achievements

Highlights

Stars

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,607 248 Updated Dec 27, 2024

shengyp / doing_the_PhD

2,065 265 Updated Dec 26, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,706 356 Updated Jan 13, 2025

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 191 10 Updated Dec 30, 2024

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,695 584 Updated Jan 8, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 43,128 4,766 Updated Jan 13, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 7,256 695 Updated Jan 13, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,385 227 Updated Jan 10, 2025

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,877 2,311 Updated Jan 10, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 22,513 1,829 Updated Jan 12, 2025

Edward-Sun / gpt-accelera

Simple and efficient pytorch-native transformer training and inference (batched)

Python 66 4 Updated Apr 2, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 70,613 10,194 Updated Jan 12, 2025

UMass-Foundation-Model / CHAIC

[NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge"

Python 11 Updated Nov 5, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,189 316 Updated Nov 13, 2024

Cardinal-Operations / ORLM

ORLM: Training Large Language Models for Optimization Modeling

Python 84 12 Updated Nov 4, 2024

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,142 376 Updated Dec 6, 2024

kanishkg / stream-of-search

Repository for the paper Stream of Search: Learning to Search in Language

Python 117 14 Updated Aug 10, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 7,433 1,999 Updated Jan 10, 2025

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,504 356 Updated Feb 5, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 486 56 Updated Jan 8, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,600 5,134 Updated Jan 13, 2025

lmarena / copilot-arena

Python 228 11 Updated Jan 11, 2025

yangzhch6 / ReSocratic

OptiBench and ReSocratic Synthesis Method

Python 12 Updated Oct 8, 2024

chinasatokolo / csGraduateFellowships

A curated list of fellowships for graduate students in Computer Science and related fields.

583 61 Updated Nov 10, 2024

neubig / minllama-assignment

Python 75 129 Updated Sep 24, 2024

BatsResearch / planetarium

Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL

Python 48 4 Updated Oct 16, 2024

Cranial-XIX / llm-pddl

SAS 383 34 Updated Sep 27, 2023

WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 84 6 Updated Feb 9, 2024

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 977 88 Updated Aug 5, 2024

Edward-Sun / easy-to-hard

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 109 11 Updated Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weihua Du StigLidu

Achievements

Achievements

Highlights

Block or report StigLidu

Stars

tatsu-lab / alpaca_eval

shengyp / doing_the_PhD

OpenRLHF / OpenRLHF

openpsi-project / ReaLHF

InternLM / MindSearch

All-Hands-AI / OpenHands

sgl-project / sglang

facebookresearch / lingua

meta-llama / llama-recipes

Genesis-Embodied-AI / Genesis

Edward-Sun / gpt-accelera

ggerganov / llama.cpp

UMass-Foundation-Model / CHAIC

xjdr-alt / entropix

Cardinal-Operations / ORLM

bklieger-groq / g1

kanishkg / stream-of-search

EleutherAI / lm-evaluation-harness

openai / human-eval

allenai / reward-bench

vllm-project / vllm

lmarena / copilot-arena

yangzhch6 / ReSocratic

chinasatokolo / csGraduateFellowships

neubig / minllama-assignment

BatsResearch / planetarium

Cranial-XIX / llm-pddl

WooooDyy / LLM-Reverse-Curriculum-RL

hendrycks / math

Edward-Sun / easy-to-hard