https://arxiv.org/abs/2306.06687
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark (Zhenfei Yin, Jiong Wang, Jianjian Cao, Zhelun Shi, Dingning Liu, Mukai Li, Lu Sheng, Lei Bai, Xiaoshui Huang, Zhiyong Wang, Wanli Ouyang, Jing Shao)