Skip to content
View woshiyyya's full-sized avatar
zzz
zzz

Highlights

  • Pro

Organizations

@ray-project @ACM-Class-2016

Block or report woshiyyya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 4,620 281 Updated Mar 11, 2025

Large Language Model (LLM) Systems Paper List

809 32 Updated Mar 11, 2025

An Open Source Toolkit For LLM Distillation

Python 531 60 Updated Jan 7, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 2,357 245 Updated Mar 12, 2025

PyTorch native quantization and sparsity for training and inference

Python 1,898 228 Updated Mar 12, 2025
Python 1,293 184 Updated Mar 10, 2025

LLM training code for Databricks foundation models

Python 4,175 548 Updated Mar 12, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,000 243 Updated Mar 7, 2025

Code release for "Learning Video Representations from Large Language Models"

Python 510 45 Updated Oct 1, 2023

CoreNet: A library for training deep neural networks

Jupyter Notebook 7,003 545 Updated Oct 14, 2024

Grok open release

Python 50,234 8,367 Updated Aug 30, 2024

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

Python 15,270 1,598 Updated Mar 11, 2025

A general and accurate MACs / FLOPs profiler for PyTorch models

Python 600 42 Updated May 5, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,337 1,006 Updated Nov 18, 2024

A PyTorch Native LLM Training Framework

Python 750 40 Updated Dec 27, 2024

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,029 72 Updated Jul 23, 2024

Minimalistic large language model 3D-parallelism training

Python 1,680 163 Updated Mar 10, 2025

A GPipe implementation in PyTorch

Python 835 98 Updated Jul 25, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 81,304 11,922 Updated Mar 12, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,428 2,368 Updated Mar 11, 2025

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 398 46 Updated Mar 4, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,364 171 Updated Mar 11, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,087 4,654 Updated Mar 1, 2025

It counts how many times your GitHub profile has been viewed. Free cloud micro-service.

PHP 4,233 381 Updated Dec 4, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41,223 6,214 Updated Mar 12, 2025

Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances

Python 55 6 Updated Jun 20, 2023

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,463 1,047 Updated Mar 12, 2025

Flexible components pairing 🤗 Transformers with ⚡ Pytorch Lightning

Python 611 76 Updated Nov 21, 2022

RayLLM - LLMs on Ray

Python 1,261 93 Updated May 28, 2024

30天自制C++服务器,包含教程和源代码

C++ 6,180 788 Updated Jan 21, 2025
Next