Skip to content
View yuananf's full-sized avatar
  • JD
  • BeiJing

Block or report yuananf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WiLoR hand 3d pose estimation! Simplifying WiLoR into a python package!

Python 23 2 Updated Nov 2, 2024

The official implementation of RealisDance

C 275 15 Updated Nov 14, 2024

High performance self-hosted photo and video management solution.

TypeScript 54,977 2,950 Updated Dec 27, 2024

Geometric Computer Vision Library for Spatial AI

Python 10,101 978 Updated Dec 25, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,325 1,275 Updated Dec 25, 2024

Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!

Python 565 53 Updated Dec 22, 2024

TensorRT plugin for 3-dimension grid sample operator

C++ 20 3 Updated Jan 5, 2024

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,305 385 Updated Dec 10, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,046 172 Updated Sep 23, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,640 1,319 Updated Sep 14, 2024

Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".

Python 1,048 60 Updated Jul 23, 2024

Inference and training library for high-quality TTS models.

Python 4,821 496 Updated Dec 10, 2024

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,462 153 Updated Dec 2, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,335 152 Updated Oct 21, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,882 2,251 Updated Dec 27, 2024

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。

Python 9,917 1,511 Updated Dec 23, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,811 1,037 Updated Dec 27, 2024

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,485 393 Updated Jul 30, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,387 4,247 Updated Dec 19, 2024

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,855 132 Updated Nov 27, 2024

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,627 355 Updated Dec 26, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,038 1,083 Updated Nov 14, 2024

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 16,136 3,400 Updated Oct 9, 2024

Minimalist ML framework for Rust

Rust 16,141 990 Updated Dec 24, 2024

StableLM: Stability AI Language Models

Jupyter Notebook 15,833 1,035 Updated Apr 8, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,819 44,672 Updated Dec 27, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,462 1,422 Updated Sep 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,287 5,707 Updated Sep 18, 2024

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,083 79 Updated Oct 16, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,410 1,860 Updated Nov 19, 2024
Next