mlsys
Here are 44 public repositories matching this topic...
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
-
Updated
Jul 25, 2025
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
-
Updated
Nov 17, 2025 - Python
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
-
Updated
Nov 21, 2025 - Python
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
-
Updated
Nov 6, 2025 - Cuda
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
-
Updated
Nov 18, 2025 - Cuda
FedScale is a scalable and extensible open-source federated learning (FL) platform.
-
Updated
Dec 18, 2023 - Python
Measure and optimize the energy consumption of your AI applications!
-
Updated
Nov 17, 2025 - Python
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
-
Updated
Jan 5, 2025 - HTML
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
-
Updated
Dec 13, 2021 - C
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
-
Updated
Sep 30, 2024 - C++
🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.
-
Updated
Nov 18, 2025 - Cuda
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
-
Updated
Jul 24, 2025 - Python
A scalable & efficient active learning/data selection system for everyone.
-
Updated
Jul 8, 2024 - Python
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention
-
Updated
Nov 12, 2025 - Python
[Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization
-
Updated
Nov 10, 2025 - Python
Optimal Sparse Decision Trees
-
Updated
Apr 27, 2023 - Python
Improve this page
Add a description, image, and links to the mlsys topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the mlsys topic, visit your repo's landing page and select "manage topics."