-
Hefei University of Technology
- Hefei, China
- https://scholar.google.com/citations?hl=zh-CN&user=Xw0l6x8AAAAJ
- https://orcid.org/0000-0003-3234-963X
Lists (3)
Sort Name ascending (A-Z)
Stars
A linear estimator on top of clip to predict the aesthetic quality of pictures
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
An open source implementation of CLIP.
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Official implementation of ImprovingText-guided ObjectInpainting with SemanticPre-inpainting in ECCV 2024
Extended LaTeX template for CVPR/ICCV papers
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Runway Inpainting based on Stable Diffusion
HuggingFace diffusers' pipeline to run ZestGuide
Code of the paper "FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation" accepted by ACMMM 2024
[ECCV 2024] FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
Official implementation of FouriScale (ECCV2024)
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Implementation of Autoregressive Diffusion in Pytorch
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
PyTorch implementation for the paper Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting (CVPR2024).
ASCII generator (image to text, image to image, video to video)
NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN
Official implementation of DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting (ACM MM 2024)