TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,166 1,070 Updated Jan 16, 2025

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,062 1,715 Updated Jan 19, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,106 1,428 Updated Jan 18, 2025

chrischoy / MakePytorchPlusPlus

How and why you want to make your pytorch CUDA/CPP extension with a Makefile

Makefile 172 16 Updated Jul 3, 2019

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,920 636 Updated Jan 16, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,953 5,214 Updated Jan 19, 2025

Azrael3000 / tmpi

Run a parallel command inside a split tmux window

Shell 141 38 Updated Feb 22, 2022

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,983 894 Updated Mar 27, 2024

Raphael-Hao / Abacus

Python 37 8 Updated Sep 22, 2021

Sys-KU / AutoTiering

Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC '21)

C 43 6 Updated Mar 31, 2022

Sys-KU / DeepPlan

Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)

C++ 54 8 Updated Mar 29, 2024

casys-kaist / HUVM

C 23 6 Updated Aug 19, 2022

casys-kaist / CoVA

Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"

Rust 16 2 Updated Sep 19, 2024

neomorphism / neomo

Neomorphism(neumorphism) Design Framework Open Source

CSS 45 5 Updated Aug 21, 2022

neoclide / coc.nvim

Nodejs extension host for vim & neovim, load extensions like VSCode and host language servers.

TypeScript 24,620 959 Updated Jan 13, 2025

khakiee / comments_collector

Collect naver entertain news comments

Python 2 2 Updated Dec 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jinwoo Jeong JinuJeong

Achievements

Achievements

Block or report JinuJeong

Stars

microsoft / ParrotServe

google / vim-codefmt

SAITPublic / PIMSimulator

microsoft / BitNet

VIA-Research / uPIMulator

abdullahfsm / PCS

sarchlab / mgpusim

sgl-project / sglang

Feh / nocache

project-baize / baize-chatbot

flexflow / flexflow-train

NVIDIA / TensorRT-LLM