brisker

cccpr brisker

Achievements

NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 14.2k 2.6k
OpenGVLab/OmniQuant OpenGVLab/OmniQuant Public

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 903 84