Stars
A list of awesome academic researches and industrial materials about Large Language Model (LLM) and Artificial Intelligence for IT Operations (AIOps).
A Datacenter Scale Distributed Inference Serving Framework
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
KusionStack & KCL 2024 Annual Report
Frame for controllers managing resource in/beyond Cluster, and offering the ability of following PodOpsLifecycle.
Operator sharding, canary, circuitbreaker and more...
Manage k8s resources effectively with risk under control.
A Kubernetes Operator that automates the deployment of Pulumi Stacks
Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.
Declarative Intent Driven Platform Orchestrator for Internal Developer Platform (IDP).
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.