Efficient vision foundation models for high-resolution generation and perception.
imagenet segmentation high-resolution vision-transformer efficientvit segment-anything deep-compression-autoencoder efficient-diffusion-model
-
Updated
Jan 24, 2025 - Python