MachineLearningSystem
Popular repositories Loading
-
25ASPLOS-Medusa
25ASPLOS-Medusa PublicForked from thustorage/Medusa
Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]
-
24MLSYS-prompt-cache
24MLSYS-prompt-cache PublicForked from yale-sys/prompt-cache
Modular and structured prompt caching for low-latency LLM inference
Python 8
-
-
Optimus-CC
Optimus-CC Public[ASPLOS'23] Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression
-
-
Awesome-DL-Scheduling-Papers
Awesome-DL-Scheduling-Papers PublicForked from S-Lab-System-Group/Awesome-DL-Scheduling-Papers
Repositories
- VILA Public Forked from NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
MachineLearningSystem/VILA’s past year of commit activity - 25ISCA-LIA_AMXGPU Public Forked from Hyungyo1/LIA_AMXGPU
[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
MachineLearningSystem/25ISCA-LIA_AMXGPU’s past year of commit activity - 25ATC-crosspipe Public Forked from spcl/crosspipe
Ongoing research training transformer models at scale
MachineLearningSystem/25ATC-crosspipe’s past year of commit activity - 25ATC-PathWeaver Public Forked from AIS-SNU/PathWeaver
A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
MachineLearningSystem/25ATC-PathWeaver’s past year of commit activity - 25ATC-Katz Public Forked from modelscope/Katz
[ATC'25] Katz is a high-performance serving system designed specifically for diffusion model workflows with multiple adapters.
MachineLearningSystem/25ATC-Katz’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…