-
UCAS
- Beijing
-
04:08
(UTC +08:00) - ddz16.github.io
- https://www.zhihu.com/people/ddz-73
Stars
A Simple Framework of Small-scale LMMs for Video Understanding
[ICCV-2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Survey: https://arxiv.org/pdf/2507.20198
π₯π₯π₯ [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Writing AI Conference Papers: A Handbook for Beginners
Diffusion Convolutional Recurrent Neural Network Implementation in PyTorch
Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow
π¬ A Researcher-Friendly Framework for Time Series Analysis. Train Any Model on Any Dataset!
[TPAMI 2025 & ICML 2024 Oral] Official repository of the SparseTSF paper: "SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters". This work is developed by the Lab of Professor β¦
Implementations, Pre-training Code and Datasets of Large Time-Series Models
The paper, dataset and code lists of underwater image enhancement
[IEEE TPAMI] A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
A latent text-to-image diffusion model
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A collection of awesome text-to-image generation studies.
A Flexible and Unified Image Restoration Framework (PyTorch), including state-of-the-art image restoration model. Such as NAFNet, Restormer, MPRNet, MIMO-UNet, SCUNet, SwinIR, HINet, etc. ββββββ
The official codebase of ECCV2024 paper: PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines.
A benchmark for the next generation of data-driven global weather models.
LaTex Template for my PhD Application, including CV and RP...
AcadHomepage: A Modern and Responsive Academic Personal Homepage
Pytorch implementation of the TUT model from the 2023 ICME paper: Do we really need temporal convolutions in action segmentation?
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
MATLAB code for color restoration of underwater images