https://arxiv.org/abs/2306.17165
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training (Zitian Chen, Mingyu Ding, Yikang Shen, Wei Zhan, Masayoshi Tomizuka, Erik Learned-Miller, Chuang Gan)
https://arxiv.org/abs/2306.17165
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training (Zitian Chen, Mingyu Ding, Yikang Shen, Wei Zhan, Masayoshi Tomizuka, Erik Learned-Miller, Chuang Gan)