Stars
Dynamic Memory Management for Serving LLMs without PagedAttention
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Convolutional Neural Networks On Ninapro datasets
This project uses IMU sensor data fusion with Machine Learning to implement a gesture recognition algorithm.
Basic Gesture Recognition Using mmWave Sensor - TI AWR1642
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others
Serverless LLM Serving for Everyone.
User documentation for Knative components.
The official GitHub page for the survey paper "A Survey of Large Language Models".
The GeoDataViz Toolkit is a set of resources that will help you communicate your data effectively through the design of compelling visuals. In this repository we are sharing resources, assets and o…
llama3 implementation one matrix multiplication at a time
Large Language Model (LLM) Systems Paper List
See Yue 系列主题是一个自定义样式极多、简约、充满细节的 Typora 主题。(The See Yue series theme is a Typora theme with a plethora of custom styles, minimalism, and full of details.)
Disaggregated serving system for Large Language Models (LLMs).
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
Standardized Serverless ML Inference Platform on Kubernetes
Simple, safe way to store and distribute tensors
A Cross-Platform, Multi-Cloud High-Performance Computing Platform