#
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
6
results
for source starred repositories
written in Cuda
Clear filter
A massively parallel, optimal functional runtime in Rust
FlashInfer: Kernel Library for LLM Serving
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Reference implementation of Megalodon 7B model
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.