Skip to content
View smile2game's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report smile2game

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for D-DiT

Jupyter Notebook 23 3 Updated Apr 1, 2025

CVPR 2025 论文和开源项目合集

19,681 2,661 Updated Mar 11, 2025

A light llama-like llm inference framework based on the triton kernel.

Python 108 13 Updated Apr 18, 2025

Diffusion Transformers (DiTs) trained on MNIST dataset

Python 100 15 Updated Apr 4, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,056 190 Updated Oct 31, 2024

[ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"

Python 40 7 Updated Jul 4, 2024

This repo implements Diffusion Transformers(DiT) in PyTorch and provides training and inference code on CelebHQ dataset

Python 28 6 Updated Jan 6, 2025

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 24,225 4,928 Updated Feb 6, 2025

LLM/MLOps/LLMOps

HTML 84 16 Updated Sep 11, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 16,651 1,944 Updated Apr 13, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 55,576 11,950 Updated Apr 9, 2025

simplest online-softmax notebook for explain Flash Attention

Jupyter Notebook 9 Updated Dec 27, 2024

An Open-source Platform for Inverse Lithography Technology Research

Python 151 39 Updated Apr 16, 2025
Python 72 5 Updated May 4, 2021

tiny ring attention implement for learning purpose

Python 7 1 Updated Feb 14, 2024

auto sign cursor

Python 8,412 1,253 Updated Apr 12, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,189 2,516 Updated Mar 27, 2025

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Python 145 6 Updated Nov 5, 2024

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 971 47 Updated Apr 16, 2025

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 1,338 92 Updated Apr 15, 2025

The best OSS video generation models

Python 3,101 337 Updated Jan 8, 2025

VideoSys: An easy and efficient system for video generation

Python 1,956 129 Updated Mar 9, 2025

Development repository for the Triton language and compiler

MLIR 15,280 1,935 Updated Apr 19, 2025
1 Updated Nov 28, 2022

五天刷题,三天模拟!快速掌握leetcode解题套路!

C++ 5 1 Updated Jun 5, 2022

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 475 38 Updated Apr 18, 2025

* this is a draft version; * this repo's commit history is omitted due to double-blind requirements for paper review, may be overwrite or deprecated later

Python 5 2 Updated Jun 20, 2024
Python 79 11 Updated Apr 15, 2025

Multithreaded matrix multiplication and analysis based on OpenMP and PThread

Cuda 147 36 Updated Nov 25, 2023
Next