https://arxiv.org/abs/2301.11325
MusicLM: Generating Music From Text (Andrea Agostinelli, Timo I. Denk, Zalán Borsos, Jesse Engel, Mauro Verzetti, Antoine Caillon, Qingqing Huang, Aren Jansen, Adam Roberts, Marco Tagliasacchi, Matt Sharifi, Neil Zeghidour, Christian Frank)
https://google-research.github.io/seanet/musiclm/examples/
text2img. 어느 정도의 결과인지는 다음 트윗 타래를 인용하는 것이 적절할 것 같네요.
https://twitter.com/keunwoochoi/status/1618809167573286912
#audio_generation