fupiao1998

MaoYuxin fupiao1998

Student

32 followers · 27 following

https://orcid.org/0000-0002-9239-091X

Achievements

Stars

soham97 / mellow

small audio language model for reasoning

Python 50 1 Updated Mar 25, 2025

FoundationVision / UniTok

A Unified Tokenizer for Visual Generation and Understanding

Python 218 5 Updated Mar 3, 2025

ChrisDong-THU / GaussianToken

Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Python 69 2 Updated Feb 17, 2025

CIntellifusion / VideoDPO

Official Implementation of VideoDPO

Python 72 Updated Jan 12, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,342 1,014 Updated Mar 29, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,023 51 Updated Feb 25, 2025

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 55,147 11,917 Updated Mar 17, 2025

InfiMM / Awesome-Multimodal-LLM-for-Math-STEM

Paper collections of multi-modal LLM for Math/STEM/Code.

84 4 Updated Mar 30, 2025

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 321 14 Updated Feb 28, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,432 271 Updated Mar 1, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,143 58 Updated Feb 8, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,434 179 Updated Mar 18, 2025

hao-ai-lab / FastVideo

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 1,284 76 Updated Mar 30, 2025

allenai / molmo

Code for the Molmo Vision-Language Model

Python 347 26 Updated Dec 12, 2024

modelscope / facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,358 875 Updated Dec 10, 2024

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 440 12 Updated Mar 27, 2025

min-hieu / Tutorial_4

A Tutorial for Diffusion Models

Jupyter Notebook 45 5 Updated Jul 17, 2023

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

394 20 Updated Mar 27, 2025

ByteFlow-AI / TokenFlow

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 296 1 Updated Mar 5, 2025

naver / croco

Python 389 44 Updated Jul 30, 2024

tal-tech / chinese-k12-evaluation

Python 22 2 Updated Mar 21, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 29,352 2,319 Updated Mar 27, 2025

OpenMOSS / GAOKAO-MM

[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation

Python 55 5 Updated Mar 13, 2024

huangwb8 / ChineseResearchLaTeX

中国科研常用LaTeX模板集

TeX 495 68 Updated Mar 11, 2025

uncbiag / LiVOS

LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)

Python 28 2 Updated Mar 10, 2025

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,589 74 Updated Feb 11, 2025

JishengBai / AudioSetCaps

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 123 2 Updated Dec 13, 2024

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 3,136 395 Updated Mar 30, 2025

Vision-CAIR / LongVU

Python 366 27 Updated Feb 28, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,065 62 Updated Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MaoYuxin fupiao1998

Achievements

Achievements

Block or report fupiao1998

Stars

soham97 / mellow

FoundationVision / UniTok

ChrisDong-THU / GaussianToken

CIntellifusion / VideoDPO

Wan-Video / Wan2.1

LTH14 / fractalgen

youngyangyang04 / leetcode-master

InfiMM / Awesome-Multimodal-LLM-for-Math-STEM

Ola-Omni / Ola

Deep-Agent / R1-V

EvolvingLMMs-Lab / open-r1-multimodal

MiniMax-AI / MiniMax-01

hao-ai-lab / FastVideo

allenai / molmo

modelscope / facechain

baaivision / NOVA

min-hieu / Tutorial_4

daixiangzi / Awesome-Token-Compress

ByteFlow-AI / TokenFlow

naver / croco

tal-tech / chinese-k12-evaluation

opendatalab / MinerU

OpenMOSS / GAOKAO-MM

huangwb8 / ChineseResearchLaTeX

uncbiag / LiVOS

NVIDIA / Cosmos-Tokenizer

JishengBai / AudioSetCaps

PKU-Alignment / align-anything

Vision-CAIR / LongVU

rhymes-ai / Allegro