Skip to content
View xinntao's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TencentARC @XPixelGroup

Block or report xinntao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Improving Video Generation with Human Feedback

Python 95 Updated Feb 12, 2025

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 260 9 Updated Jan 15, 2025

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 298 14 Updated Feb 7, 2025

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 490 14 Updated Dec 11, 2024

[ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation

84 Updated Dec 11, 2024

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024

Excalidraw app for mac. Powered by pure SwiftUI.

Swift 348 21 Updated Feb 13, 2025

Let your Claude able to think

TypeScript 14,327 1,675 Updated Jan 23, 2025

Deep Reinforcement Learning

3,557 606 Updated Dec 10, 2022

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,730 124 Updated Dec 6, 2024

Next-Token Prediction is All You Need

Python 2,001 78 Updated Oct 24, 2024

Kolors Team

Python 4,186 315 Updated Nov 13, 2024

Bring portraits to life!

Python 14,056 1,512 Updated Feb 13, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 830 32 Updated Feb 14, 2025

A PyTorch native library for large model training

Python 3,320 275 Updated Feb 18, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 405 11 Updated Sep 2, 2024
Python 355 15 Updated Oct 21, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 3,634 391 Updated Jan 3, 2025

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,180 379 Updated Feb 17, 2025

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,487 121 Updated Dec 17, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 19,935 5,936 Updated Feb 12, 2025

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 572 24 Updated Oct 25, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 895 71 Updated Nov 7, 2023

A simple HTML visualization tool for computer vision research 🛠️

Python 242 15 Updated Feb 13, 2025

Transparent Image Layer Diffusion using Latent Transparency

2,071 30 Updated Jun 16, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,774 225 Updated Sep 8, 2024

ICLR 2024 (Spotlight)

Python 746 20 Updated Mar 2, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,785 775 Updated Oct 31, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,391 75 Updated Sep 20, 2024
Next