Skip to content
View caoandong's full-sized avatar

Block or report caoandong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

24 Updated Dec 31, 2024

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 775 42 Updated Dec 28, 2024

State Management and Multiplayer Networking for Turn-Based Games

TypeScript 10,868 715 Updated Dec 30, 2024

Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion

Python 28 Updated Dec 20, 2024

High performance UI layout library in C.

C 8,940 289 Updated Dec 31, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 20,850 1,605 Updated Dec 31, 2024

Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"

Python 29 Updated Dec 17, 2024

Learning Flow Fields in Attention for Controllable Person Image Generation

Python 790 78 Updated Dec 20, 2024

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Jupyter Notebook 38 Updated Dec 27, 2024

Official code for Neural LightRig.

97 1 Updated Dec 13, 2024

[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 336 12 Updated Dec 11, 2024

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 580 14 Updated Dec 21, 2024
Jupyter Notebook 32 Updated Dec 5, 2024

SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust

Rust 1,473 39 Updated Dec 23, 2024

Template repo with the latest tech working together

TypeScript 99 7 Updated Jan 1, 2025

The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Python 106 5 Updated Dec 11, 2024

Official implementation of OneDiffusion paper

Python 554 19 Updated Dec 14, 2024

Official repository for LTX-Video

Python 2,269 169 Updated Dec 20, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,212 376 Updated Dec 27, 2024

A minimal and universal controller for FLUX.1.

Python 1,016 62 Updated Dec 30, 2024

Subjects200K dataset

Jupyter Notebook 88 3 Updated Dec 24, 2024

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation

111 4 Updated Nov 26, 2024

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

603 32 Updated Nov 29, 2024

Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead

Python 201 6 Updated Dec 15, 2024

NanoGPT (124M) in 3.6 minutes

Python 1,990 189 Updated Jan 1, 2025

ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements

Python 17 1 Updated Dec 1, 2024

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥

Python 507 22 Updated Dec 27, 2024

[Preprint] Number it: Temporal Grounding Videos like Flipping Manga

Python 48 1 Updated Nov 29, 2024

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,114 66 Updated Dec 7, 2024

Unifying 3D Mesh Generation with Language Models

Python 833 41 Updated Dec 5, 2024
Next