Skip to content
View lt2000's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report lt2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Dynamic Memory Management for Serving LLMs without PagedAttention

C 273 20 Updated Dec 6, 2024
Python 15 Updated Jan 3, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,780 1,896 Updated Jul 26, 2024

Convolutional Neural Networks On Ninapro datasets

Python 18 6 Updated Nov 28, 2017

This project uses IMU sensor data fusion with Machine Learning to implement a gesture recognition algorithm.

Jupyter Notebook 8 Updated Dec 6, 2022

Basic Gesture Recognition Using mmWave Sensor - TI AWR1642

Python 123 22 Updated Sep 12, 2024

Microsoft Azure Traces

Jupyter Notebook 868 148 Updated Dec 12, 2024

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 7,912 1,005 Updated Jan 21, 2025
Jupyter Notebook 48 4 Updated Jun 13, 2024

Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others

Jupyter Notebook 30 7 Updated Oct 17, 2024

Serverless LLM Serving for Everyone.

Python 404 37 Updated Jan 20, 2025
Python 10 6 Updated May 28, 2024

User documentation for Knative components.

JavaScript 4,613 1,238 Updated Jan 21, 2025
Python 11 2 Updated Jan 12, 2024

NVIDIA device plugin for Kubernetes

Go 2,969 652 Updated Jan 19, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,848 848 Updated Aug 20, 2024

The GeoDataViz Toolkit is a set of resources that will help you communicate your data effectively through the design of compelling visuals. In this repository we are sharing resources, assets and o…

391 61 Updated Dec 3, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,048 1,146 Updated May 23, 2024
Jupyter Notebook 12 2 Updated Jun 26, 2024

Large Language Model (LLM) Systems Paper List

739 26 Updated Jan 19, 2025

See Yue 系列主题是一个自定义样式极多、简约、充满细节的 Typora 主题。(The See Yue series theme is a Typora theme with a plethora of custom styles, minimalism, and full of details.)

CSS 213 20 Updated May 1, 2023

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 445 50 Updated Aug 19, 2024

最好用的 V2Ray 一键安装脚本 & 管理脚本

Shell 25,300 16,314 Updated Dec 5, 2024

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 80 12 Updated Jul 14, 2023

Standardized Serverless ML Inference Platform on Kubernetes

Python 3,799 1,091 Updated Jan 21, 2025

Simple, safe way to store and distribute tensors

Python 3,013 206 Updated Jan 9, 2025
C 1 1 Updated Apr 21, 2021
C 15 5 Updated Sep 9, 2024

A Cross-Platform, Multi-Cloud High-Performance Computing Platform

C 251 113 Updated Jan 6, 2025
Next