Skip to content
View gloritygithub11's full-sized avatar

Block or report gloritygithub11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

最新Claude Pro订阅教程:如何注册Claude账号?如何订阅Claude Pro会员?如何购买Claude Pro原生独立账号?如何为你现有的Claude充值?(含国内使用Claude Code教程)

452 27 Updated Sep 6, 2025

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++ 232 21 Updated Sep 30, 2024

Low-bit LLM inference on CPU/NPU with lookup table

C++ 853 70 Updated Jun 5, 2025

Awesome LLM compression research papers and tools.

1,660 107 Updated Jul 2, 2025

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,973 488 Updated Aug 18, 2024

A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 563 63 Updated Aug 22, 2025

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,083 594 Updated Sep 5, 2025

LLM inference in C/C++

C++ 86,480 13,024 Updated Sep 14, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,137 2,247 Updated Sep 10, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 11,577 1,742 Updated Sep 14, 2025

Multilingual Medicine: Model, Dataset, Benchmark, Code

Python 194 8 Updated Oct 15, 2024