Skip to content
View lhd777's full-sized avatar
  • Tsinghua University
  • Beijing

Block or report lhd777

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Taming Stable Diffusion for Lip Sync!

Python 1,761 194 Updated Jan 15, 2025

Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models

Python 533 63 Updated Jun 11, 2024

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 659 62 Updated Dec 16, 2024

Out of time: automated lip sync in the wild

Python 705 158 Updated Jan 23, 2024
Python 194 9 Updated Jul 23, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,606 247 Updated Jan 4, 2025

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 744 47 Updated Sep 8, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,857 23,118 Updated Jan 15, 2025

Select a portrait, click to move the head around (please use your own space / GPU!)

JavaScript 794 78 Updated Nov 21, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,356 376 Updated Dec 22, 2024

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Python 182 16 Updated Jan 15, 2025
Python 133 18 Updated Jul 12, 2023

MMSA is a unified framework for Multimodal Sentiment Analysis.

Python 723 113 Updated Dec 12, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,204 148 Updated Sep 3, 2024

The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning

Python 27 3 Updated Dec 16, 2024

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Python 96 3 Updated Jul 18, 2024

[ECCV 2024] Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Python 24 1 Updated Oct 9, 2024

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python 583 30 Updated Aug 16, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,205 550 Updated Jul 17, 2024

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

Python 353 84 Updated Feb 2, 2024

[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner

Python 121 8 Updated Nov 16, 2024

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Python 357 67 Updated May 12, 2024

Official implementation of ID-unaware Deepfake Detection Model

C++ 158 21 Updated Aug 15, 2023

Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/

Python 535 89 Updated May 27, 2024
Python 872 123 Updated Dec 11, 2024
JavaScript 2,804 1,010 Updated Jun 21, 2024

[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!

Jupyter Notebook 793 44 Updated Dec 1, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,095 2,319 Updated Aug 12, 2024

AIGC资料汇总学习,持续更新......

787 93 Updated Oct 22, 2023
Next