YuanGongND

Follow

Yuan Gong YuanGongND

Follow

Research Scientist, MIT CSAIL

392 followers · 2 following

MIT
Cambridge, MA
11:37 (UTC -05:00)
yuangongnd.github.io

Achievements

Achievements

Pinned Loading

ltu ltu Public

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 401 38
whisper-at whisper-at Public

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 346 28
gopt gopt Public

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Python 158 29
cav-mae cav-mae Public

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 244 23
ssast ssast Public

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 369 60
ast ast Public

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.2k 220