Skip to content
View chenxwh's full-sized avatar

Organizations

@replicate

Block or report chenxwh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • OmniGen Public

    Forked from VectorSpaceLab/OmniGen

    OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

    Jupyter Notebook MIT License Updated Nov 4, 2024
  • OmniParser Public

    Forked from microsoft/OmniParser

    A simple screen parsing tool towards pure vision based GUI agent

    Jupyter Notebook Creative Commons Attribution 4.0 International Updated Nov 1, 2024
  • Meissonic Public

    Forked from viiika/Meissonic

    Inference and Training Code of Meissonic

    Python Apache License 2.0 Updated Oct 20, 2024
  • A beautiful, simple, clean, and responsive Jekyll theme for academics

    JavaScript 1 MIT License Updated Oct 20, 2024
  • hart Public

    Forked from mit-han-lab/hart

    HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

    Python MIT License Updated Oct 19, 2024
  • Emu3 Public

    Forked from baaivision/Emu3

    Next-Token Prediction is All You Need

    Python Apache License 2.0 Updated Oct 18, 2024
  • CogView3 Public

    Forked from THUDM/CogView3

    text to image to generation: CogView3-Plus and CogView3(ECCV 2024)

    Python Apache License 2.0 Updated Oct 14, 2024
  • t2v-turbo Public

    Forked from Ji4chenLi/t2v-turbo

    Code repository for T2V-Turbo

    Python 1 1 Updated Oct 14, 2024
  • PMRF Public

    Forked from ohayonguy/PMRF

    Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

    Python MIT License Updated Oct 12, 2024
  • ml-depth-pro Public

    Forked from apple/ml-depth-pro

    Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

    Python 2 Other Updated Oct 12, 2024
  • Lotus Public

    Forked from EnVision-Research/Lotus

    Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

    Python 4 Apache License 2.0 Updated Oct 7, 2024
  • UnSAM Public

    Forked from frank-xwang/UnSAM

    [NeurIPS 2024] Code release for "Segment Anything without Supervision"

    Jupyter Notebook Updated Oct 6, 2024
  • DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

    Python Other Updated Oct 1, 2024
  • [CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

    Python Other Updated Sep 27, 2024
  • CogVLM2 Public

    Forked from THUDM/CogVLM2

    GPT4V-level open-source multi-modal model based on Llama3-8B

    Python Apache License 2.0 Updated Sep 25, 2024
  • CogVideo Public

    Forked from THUDM/CogVideo

    Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

    Python 2 Apache License 2.0 Updated Sep 25, 2024
  • LLaMA-Omni Public

    Forked from ictnlp/LLaMA-Omni

    LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

    Python 1 Apache License 2.0 Updated Sep 22, 2024
  • Enjoy the magic of Diffusion models!

    Python 1 Apache License 2.0 Updated Jul 1, 2024
  • Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

    Python 1 Apache License 2.0 Updated Jun 30, 2024
  • Omost Public

    Forked from lllyasviel/Omost

    Your image is almost there!

    Python 5 2 Apache License 2.0 Updated Jun 3, 2024
  • SadTalker Public

    Forked from OpenTalker/SadTalker

    (CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

    Python 26 15 Other Updated Jun 1, 2024
  • HunyuanDiT Public

    Forked from Tencent/HunyuanDiT

    Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Python 1 Other Updated May 24, 2024
  • Create Magic Story!

    Jupyter Notebook 1 Updated May 4, 2024
  • OpenVoice Public

    Forked from myshell-ai/OpenVoice

    Instant voice cloning by MyShell.

    Python 24 6 MIT License Updated Apr 28, 2024
  • PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

    Python 3 GNU Affero General Public License v3.0 Updated Apr 13, 2024
  • Kandinsky 2 — multilingual text2image latent diffusion model

    Jupyter Notebook 88 37 Apache License 2.0 Updated Apr 12, 2024
  • AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

    Python 5 Apache License 2.0 Updated Apr 1, 2024
  • [CVPR 2024] Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

    Python 1 MIT License Updated Mar 22, 2024
  • cog-c4ai Public

    Python Updated Mar 19, 2024
  • AVeriTeC Public

    Forked from MichSchli/AVeriTeC
    Python Updated Mar 17, 2024