-
vidur Public
Forked from microsoft/vidurA large-scale simulation framework for LLM inference
-
-
ns-3-alibabacloud Public
Forked from aliyun/ns-3-alibabacloud -
sarathi-serve Public
Forked from microsoft/sarathi-serveA low-latency & high-throughput serving engine for LLMs
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedDec 16, 2024 -
-
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
-
-
-
KVSharer Public
Forked from yangyifei729/KVSharerSource code of paper ''KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing''
-
transformer-explainer Public
Forked from poloclub/transformer-explainerTransformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
JavaScript MIT License UpdatedOct 15, 2024 -
LLMServingSim Public
Forked from casys-kaist/LLMServingSimLLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale
Python MIT License UpdatedAug 1, 2024 -
-
GenZ-LLM-Analyzer Public
Forked from abhibambhaniya/GenZ-LLM-AnalyzerLLM Inference analyzer for different hardware platforms
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
-
astrasim Public
Forked from zzudongxiang/astrasimASTRA-sim 是一个分布式机器学习系统模拟器。它可以系统地研究现代深度学习系统所面临的挑战,探索瓶颈问题,并为未来不同平台上开发大型 DNN 模型提供高效的方法。
UpdatedJun 25, 2024 -
astra-sim Public
Forked from astra-sim/astra-simASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
C++ MIT License UpdatedJun 13, 2024 -
-
PALM Public
Forked from fangjh21/PALMPALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
Python UpdatedJun 7, 2024 -
-
ElasticFlow Public
Forked from pkusys/ElasticFlowArtifacts for our ASPLOS'23 paper ElasticFlow
Python Apache License 2.0 UpdatedMay 28, 2024 -
llama Public
Forked from meta-llama/llamaInference code for Llama models
Python Other UpdatedMay 15, 2024 -
-
transformers-code Public
Forked from zyds/transformers-code手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
Jupyter Notebook UpdatedApr 6, 2024 -
-
-
chatgpt-web-share Public
Forked from chatpire/chatgpt-web-shareChatGPT Plus 共享方案。ChatGPT Plus / OpenAI API sharing solution.
Vue GNU General Public License v3.0 UpdatedDec 1, 2023 -
-
Awesome-LLM-System-Papers Public
Forked from AmadeusChan/Awesome-LLM-System-PapersUpdatedOct 31, 2023 -
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedSep 11, 2023