-
CVL, ETH Zurich
- Zurich, Switzerland
- https://ha0tang.github.io/
- @HaoTang_ai
Stars
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution
[CVPR'25] Official Implementation of MambaIC: State Space Models for High-Performance Learned Image Compression
Official repo of "Barbie: Text to Barbie-Style 3D Avatars“
[PR 2024] GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
The repository is dedicated to tracking the latest advances in the field of Physical Adversarial Attack (PAA).
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)
🔥 [ECCV 2024] Motion Mamba: Efficient and Long Sequence Motion Generation
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
https://camera-agnostic.github.io/
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
[ICCV2023] Dataset Quantization
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
InterFormer method to generate a reaction motion based on an action motion with 3D skeletons
Pytorch codes for 'LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images'
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
Official Pytorch implementation for our AAAI 2023 paper HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design
[AAAI 2023 Oral] Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
[AAAI 2023] Dynamic Text-guided Image Editing Adversarial Networks
SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction