Skip to content
View tulvgengenr's full-sized avatar
😪
Sleeping always
😪
Sleeping always

Block or report tulvgengenr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

HPC

10 repositories

高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!

Jupyter Notebook 391 35 Updated Mar 28, 2023

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 1,832 191 Updated Jan 1, 2025

This is a Chinese translation of the CUDA programming guide

1,381 218 Updated Nov 13, 2024

A CUDA tutorial to make people learn CUDA program from 0

Cuda 200 53 Updated Jul 9, 2024

Repository for HPCGame 1st Problems.

Go 56 7 Updated Feb 6, 2024

Material for gpu-mode lectures

Jupyter Notebook 3,334 340 Updated Dec 3, 2024

An ML Systems Onboarding list

592 19 Updated Nov 14, 2024

Pipeline Parallelism for PyTorch

Python 730 86 Updated Aug 21, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,360 348 Updated Dec 23, 2024

nnScaler: Compiling DNN models for Parallel Training

Python 82 13 Updated Dec 10, 2024