Skip to content
View Abatom's full-sized avatar
  • xiaomi
  • Beijing
  • 18:41 (UTC +08:00)

Block or report Abatom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,271 149 Updated Dec 23, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,889 1,023 Updated Dec 25, 2024

MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架

C++ 4,778 542 Updated Oct 24, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,723 521 Updated Dec 14, 2024

Applied AI experiments and examples for PyTorch

Python 193 18 Updated Dec 17, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,946 448 Updated Dec 27, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,595 463 Updated Nov 21, 2024
Python 4 Updated Aug 21, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 6 Updated Dec 27, 2024

Development repository for the Triton language and compiler

C++ 13,819 1,687 Updated Dec 27, 2024

Serving multiple LoRA finetuned LLM as one

Python 1,006 46 Updated May 8, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,276 131 Updated Dec 26, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,660 4,971 Updated Dec 27, 2024

Ongoing research training transformer models at scale

Python 10,928 2,444 Updated Dec 23, 2024

Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

Jupyter Notebook 2,056 249 Updated Apr 30, 2021

A Deep Learning based project for colorizing and restoring old images (and video!)

Python 18,091 2,586 Updated Oct 19, 2024

StyleGAN2 - Official TensorFlow Implementation

Python 11,017 2,528 Updated May 18, 2024

CUDA Kernel Benchmarking Library

Cuda 539 69 Updated Nov 20, 2024

An unidentifiable mechanism that helps you bypass GFW.

C++ 19,034 3,050 Updated Aug 21, 2024

⭐ Linux / Windows / macOS 跨平台 V2Ray 客户端 | 支持 VMess / VLESS / SSR / Trojan / Trojan-Go / NaiveProxy / HTTP / HTTPS / SOCKS5 | 使用 C++ / Qt 开发 | 可拓展插件式设计 ⭐

C++ 16,772 3,264 Updated Sep 3, 2024

v2ray节点、免费节点、免费v2ray节点、最新公益免费v2ray节点订阅地址、免费v2ray节点每日更新、免费ss/v2ray/trojan节点、freefq

6,815 449 Updated Dec 27, 2024

A web GUI client of Project V which supports VMess, VLESS, SS, SSR, Trojan, Tuic and Juicity protocols. 🚀

Go 11,845 1,246 Updated Dec 19, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,630 163 Updated Dec 25, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,311 22,978 Updated Dec 27, 2024

Universal markup converter

Haskell 35,201 3,404 Updated Dec 23, 2024

A V2Ray client for Android, support Xray core and v2fly core

Kotlin 37,276 5,649 Updated Dec 25, 2024

A GUI client for Windows, Linux and macOS, support Xray core and sing-box-core and others

C# 72,407 11,843 Updated Dec 27, 2024

Animation engine for explanatory math videos

Python 72,509 6,349 Updated Dec 26, 2024

Acode - powerful text/code editor for android

JavaScript 2,988 406 Updated Dec 25, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,359 163 Updated Jun 25, 2024
Next