Efficient vision foundation models for high-resolution generation and perception.
imagenet
segmentation
high-resolution
vision-transformer
efficientvit
segment-anything
deep-compression-autoencoder
efficient-diffusion-model
-
Updated
Nov 12, 2024 - Python