Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 163 Bytes

230511 Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.md

File metadata and controls

3 lines (2 loc) · 163 Bytes

https://arxiv.org/abs/2305.07011

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers (Dahun Kim, Anelia Angelova, Weicheng Kuo)