JensenFire

JensenFire JensenFire

Focusing on interesting things

Achievements

vllm_flash_attn vllm_flash_attn Public

C++ 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
aotriton aotriton Public

Forked from ROCm/aotriton

Ahead of Time (AOT) Triton Math Library

Python
CUDA-Learn-Notes CUDA-Learn-Notes Public

Forked from xlite-dev/LeetCUDA

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda
Self-learning-Computer-Science Self-learning-Computer-Science Public

Forked from PKUFlyingPig/Self-learning-Computer-Science

the resources I use to learn computer science in my spare time
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python