chenxwh

Chenxi chenxwh

412 followers · 3 following

Achievements

x3 x3

Achievements

x3 x3

Organizations

OmniGen Public
Forked from VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook MIT License Updated Nov 4, 2024
OmniParser Public
Forked from microsoft/OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook Creative Commons Attribution 4.0 International Updated Nov 1, 2024
Meissonic Public
Forked from viiika/Meissonic

Inference and Training Code of Meissonic

Python Apache License 2.0 Updated Oct 20, 2024
chenxwh.github.io Public
Forked from alshedivat/al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

JavaScript 1 MIT License Updated Oct 20, 2024
hart Public
Forked from mit-han-lab/hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python MIT License Updated Oct 19, 2024
Emu3 Public
Forked from baaivision/Emu3

Next-Token Prediction is All You Need

Python Apache License 2.0 Updated Oct 18, 2024
CogView3 Public
Forked from THUDM/CogView3

text to image to generation: CogView3-Plus and CogView3(ECCV 2024)

Python Apache License 2.0 Updated Oct 14, 2024
t2v-turbo Public
Forked from Ji4chenLi/t2v-turbo

Code repository for T2V-Turbo

Python 1 1 Updated Oct 14, 2024
PMRF Public
Forked from ohayonguy/PMRF

Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

Python MIT License Updated Oct 12, 2024
ml-depth-pro Public
Forked from apple/ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 2 Other Updated Oct 12, 2024
Lotus Public
Forked from EnVision-Research/Lotus

Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 4 Apache License 2.0 Updated Oct 7, 2024
UnSAM Public
Forked from frank-xwang/UnSAM

[NeurIPS 2024] Code release for "Segment Anything without Supervision"

Jupyter Notebook Updated Oct 6, 2024
DepthCrafter Public
Forked from Tencent/DepthCrafter

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python Other Updated Oct 1, 2024
Upscale-A-Video Public
Forked from sczhou/Upscale-A-Video

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python Other Updated Sep 27, 2024
CogVLM2 Public
Forked from THUDM/CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python Apache License 2.0 Updated Sep 25, 2024
CogVideo Public
Forked from THUDM/CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 2 Apache License 2.0 Updated Sep 25, 2024
LLaMA-Omni Public
Forked from ictnlp/LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 1 Apache License 2.0 Updated Sep 22, 2024
DiffSynth-Studio Public
Forked from modelscope/DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 1 Apache License 2.0 Updated Jul 1, 2024
Depth-Anything-V2 Public
Forked from DepthAnything/Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 1 Apache License 2.0 Updated Jun 30, 2024
Omost Public
Forked from lllyasviel/Omost

Your image is almost there!

Python 5 2 Apache License 2.0 Updated Jun 3, 2024
SadTalker Public
Forked from OpenTalker/SadTalker

（CVPR 2023）SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 26 15 Other Updated Jun 1, 2024
HunyuanDiT Public
Forked from Tencent/HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 1 Other Updated May 24, 2024
StoryDiffusion Public
Forked from HVision-NKU/StoryDiffusion

Create Magic Story!

Jupyter Notebook 1 Updated May 4, 2024
OpenVoice Public
Forked from myshell-ai/OpenVoice

Instant voice cloning by MyShell.

Python 24 6 MIT License Updated Apr 28, 2024
PixArt-sigma Public
Forked from PixArt-alpha/PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 3 GNU Affero General Public License v3.0 Updated Apr 13, 2024
Kandinsky-2 Public
Forked from ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 88 37 Apache License 2.0 Updated Apr 12, 2024
AniPortrait Public
Forked from Zejun-Yang/AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5 Apache License 2.0 Updated Apr 1, 2024
Smooth-Diffusion Public
Forked from SHI-Labs/Smooth-Diffusion

[CVPR 2024] Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Python 1 MIT License Updated Mar 22, 2024
cog-c4ai Public

Python Updated Mar 19, 2024
AVeriTeC Public
Forked from MichSchli/AVeriTeC

Python Updated Mar 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chenxi chenxwh

Achievements

Achievements

Organizations

Block or report chenxwh

OmniGen Public

OmniParser Public

Meissonic Public

chenxwh.github.io Public

hart Public

Emu3 Public

CogView3 Public

t2v-turbo Public

PMRF Public

ml-depth-pro Public

Lotus Public

UnSAM Public

DepthCrafter Public

Upscale-A-Video Public

CogVLM2 Public

CogVideo Public

LLaMA-Omni Public

DiffSynth-Studio Public

Depth-Anything-V2 Public

Omost Public

SadTalker Public

HunyuanDiT Public

StoryDiffusion Public

OpenVoice Public

PixArt-sigma Public

Kandinsky-2 Public

AniPortrait Public

Smooth-Diffusion Public

cog-c4ai Public

AVeriTeC Public