Skip to content
View xinntao's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TencentARC @XPixelGroup

Block or report xinntao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Deep Reinforcement Learning

3,341 588 Updated Dec 10, 2022

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,522 100 Updated Nov 11, 2024

Next-Token Prediction is All You Need

Python 1,805 70 Updated Oct 24, 2024

Kolors Team

Python 3,835 264 Updated Sep 4, 2024

Bring portraits to life!

Python 12,906 1,369 Updated Nov 12, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 693 28 Updated Sep 27, 2024

A native PyTorch Library for large model training

Python 2,594 204 Updated Nov 5, 2024

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 367 9 Updated Sep 2, 2024
Python 344 14 Updated Oct 21, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 3,319 354 Updated Nov 11, 2024

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 7,472 350 Updated Oct 26, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,428 119 Updated Jul 17, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 17,684 5,519 Updated Nov 8, 2024

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 524 19 Updated Oct 25, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 869 72 Updated Nov 7, 2023

A simple HTML visualization tool for computer vision research 🛠️

Python 236 14 Updated Oct 28, 2024

Transparent Image Layer Diffusion using Latent Transparency

2,017 26 Updated Jun 16, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,577 206 Updated Sep 8, 2024

ICLR 2024 (Spotlight)

Python 724 22 Updated Mar 2, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,539 767 Updated Oct 31, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,321 70 Updated Sep 20, 2024

Official code of SmartEdit [CVPR-2024 Highlight]

Python 250 8 Updated Jun 21, 2024

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

351 13 Updated Mar 29, 2024

Easily create large video dataset from video urls

Python 545 65 Updated Jul 30, 2024

A lightweight tool for camera pose visualization

Python 97 6 Updated Sep 19, 2024

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Python 315 12 Updated Jul 11, 2024

Official implementation of SEED-LLaMA (ICLR 2024).

Python 575 31 Updated Sep 21, 2024

Official codes for DeSRA (ICML 2023)

Python 126 Updated Feb 2, 2024

Implementation of “DreamDiffusion: Generating High-Quality Images from Brain EEG Signals”

Python 474 58 Updated Jan 30, 2024

NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Python 395 19 Updated May 14, 2024
Next