michaelzhiluo

Michael Luo michaelzhiluo

UC Berkeley PhD

85 followers · 6 following

UC Berkeley
www.michaelzhiluo.com

Achievements

x2 x3

Achievements

x2 x3

Highlights

Stars

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,374 107 Updated Feb 13, 2025

StructuredLabs / preswald

🐵 Preswald is a full-stack platform for building, deploying, and managing interactive data applications. It brings ingestion, storage, transformation, and visualization into a simple SDK, minimizin…

Python 1,255 47 Updated Feb 15, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,049 1,297 Updated Feb 1, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,371 2,549 Updated Feb 15, 2025

amazon-science / PAE

Python 45 2 Updated Feb 11, 2025

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,617 127 Updated Jan 17, 2025

deepseek-ai / DeepSeek-R1

75,370 9,748 Updated Feb 14, 2025

deepseek-ai / DeepSeek-V3

Python 84,575 13,568 Updated Feb 14, 2025

uccl-project / uccl

Ultra | Ultimate | Unified CCL

C++ 30 2 Updated Feb 14, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 9,626 915 Updated Feb 15, 2025

volcengine / verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python 3,150 267 Updated Feb 15, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,229 78 Updated Feb 4, 2025

SimpleBerry / LLaMA-O1

Large Reasoning Models

Python 802 44 Updated Dec 3, 2024

kohjingyu / search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

Python 175 20 Updated Jul 25, 2024

lapisrocks / LanguageAgentTreeSearch

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Python 729 74 Updated Jul 30, 2024

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 303 23 Updated Nov 26, 2024

google-research / google-research

Google Research

Jupyter Notebook 34,900 8,012 Updated Feb 11, 2025

namin / llm-verified-with-monte-carlo-tree-search

LLM verified with Monte Carlo Tree Search

Jupyter Notebook 263 27 Updated Feb 7, 2025

codelion / optillm

Optimizing inference proxy for LLMs

Python 2,027 158 Updated Feb 14, 2025

langchain-ai / langgraph

Build resilient language agents as graphs.

Python 9,012 1,480 Updated Feb 15, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 733 29 Updated Sep 21, 2024

HazyResearch / eclair-agents

Automating enterprise workflows with multimodal agents

Jupyter Notebook 99 14 Updated Oct 9, 2024

spcl / graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Python 2,279 166 Updated Dec 11, 2024

haoliuhl / ringattention

Large Context Attention

Python 681 53 Updated Jan 24, 2025

Ag2S1 / Sibyl-System

Python 112 9 Updated Aug 13, 2024

microsoft / vscode

Visual Studio Code

TypeScript 167,350 30,533 Updated Feb 15, 2025

coder / code-server

VS Code in the browser

TypeScript 69,795 5,767 Updated Feb 14, 2025

microsoft / ParrotServe

[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable

Python 143 8 Updated Sep 21, 2024

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,076 176 Updated Jan 28, 2025

tyler-griggs / melange-release

Python 43 5 Updated Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Michael Luo michaelzhiluo

Achievements

Achievements

Highlights

Block or report michaelzhiluo

Stars

agentica-project / deepscaler

StructuredLabs / preswald

Jiayi-Pan / TinyZero

NVIDIA / Megatron-LM

amazon-science / PAE

openreasoner / openr

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

uccl-project / uccl

sgl-project / sglang

volcengine / verl

PRIME-RL / PRIME

SimpleBerry / LLaMA-O1

kohjingyu / search-agents

lapisrocks / LanguageAgentTreeSearch

DigiRL-agent / digirl

google-research / google-research

namin / llm-verified-with-monte-carlo-tree-search

codelion / optillm

langchain-ai / langgraph

efeslab / Nanoflow

HazyResearch / eclair-agents

spcl / graph-of-thoughts

haoliuhl / ringattention

Ag2S1 / Sibyl-System

microsoft / vscode

coder / code-server

microsoft / ParrotServe

zou-group / textgrad

tyler-griggs / melange-release