-
USTC
- HeFei, China
- https://guopeng-gpli.github.io/
Highlights
- Pro
-
-
skypilot Public
Forked from skypilot-org/skypilotSkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Python Apache License 2.0 UpdatedDec 29, 2024 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedDec 23, 2024 -
leetcode Public
Forked from doocs/leetcode🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
Java Creative Commons Attribution Share Alike 4.0 International UpdatedDec 17, 2024 -
ServerlessLLM Public
Forked from ServerlessLLM/ServerlessLLMServerless LLM Serving for Everyone.
Python Apache License 2.0 UpdatedNov 18, 2024 -
LMCache Public
Forked from LMCache/LMCachePrefill LLMs only once, re-use KV across instances
Python Apache License 2.0 UpdatedOct 27, 2024 -
data-release Public
Forked from sir-lab/data-releaseHuawei Cloud datasets
Jupyter Notebook UpdatedOct 21, 2024 -
PowerInfer Public
Forked from SJTU-IPADS/PowerInferHigh-speed Large Language Model Serving on PCs with Consumer-grade GPUs
C++ MIT License UpdatedJul 15, 2024 -
dLoRA-artifact Public
Forked from LLMServe/dLoRA-artifactJupyter Notebook Apache License 2.0 UpdatedMay 28, 2024 -
herald Public
Forked from HKUST-SING/heraldHerald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)
Python Apache License 2.0 UpdatedMay 9, 2024 -
-
AcmeTrace Public
Forked from InternLM/AcmeTraceJupyter Notebook Creative Commons Attribution 4.0 International UpdatedMar 12, 2024 -
semantic-kernel-docs Public
Forked from MicrosoftDocs/semantic-kernel-docsSemantic Kernel (SK) is a lightweight SDK enabling integration of AI Large Language Models (LLMs) with conventional programming languages.
MIT License UpdatedDec 11, 2023 -
-
MetaGPT Public
Forked from geekan/MetaGPT🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
Python MIT License UpdatedOct 24, 2023 -
bagpipe Public
Forked from uw-mad-dash/bagpipeCode for reproducing results for SOSP paper Bagpipe
Python MIT License UpdatedOct 20, 2023 -
executorch Public
Forked from pytorch/executorchEnd-to-end solution for enabling on-device AI across mobile and edge devices for PyTorch models
C++ Other UpdatedOct 18, 2023 -
GPTCache Public
Forked from zilliztech/GPTCacheSemantic cache for LLMs. Fully integrated with LangChain and llama_index.
Python MIT License UpdatedAug 15, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python Other UpdatedJul 24, 2023 -
llm-caching-multiplexing Public
Forked from Ying1123/llm-caching-multiplexingJupyter Notebook Apache License 2.0 UpdatedJun 3, 2023 -
faas-cli Public
Forked from openfaas/faas-cliOfficial CLI for OpenFaaS
Go Other UpdatedNov 18, 2022 -
faas Public
Forked from openfaas/faasOpenFaaS - Serverless Functions Made Simple
Go MIT License UpdatedOct 24, 2022 -
faasd Public
Forked from openfaas/faasdA lightweight & portable faas engine
Go MIT License UpdatedOct 24, 2022 -
YCSB Public
Forked from brianfrankcooper/YCSBYahoo! Cloud Serving Benchmark
Java Apache License 2.0 UpdatedJul 1, 2022 -
EdgeFaaSBench Public
Forked from kaustubhrajput46/EdgeFaaSBenchJupyter Notebook UpdatedMay 23, 2022 -
AI-System Public
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Python Other UpdatedMar 25, 2022 -
datasets Public
Forked from kaustubhrajput46/datasetsNeeded datasets will be added here.
UpdatedJul 29, 2021 -
Delayed-Hits Public
Forked from cmu-snap/Delayed-HitsArtifacts for the "Caching with Delayed Hits" paper as it appears in SIGCOMM '20.
-
mlcache Public
Forked from jcpazos/mlcacheMLCache: A Multi-Armed Bandit Policy for an Operating System Page Cache
C UpdatedApr 19, 2021 -