Skip to content
View YuanGongND's full-sized avatar

Block or report YuanGongND

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ltu ltu Public

    Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

    Python 385 36

  2. whisper-at whisper-at Public

    Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

    Python 321 27

  3. gopt gopt Public

    Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

    Python 150 27

  4. cav-mae cav-mae Public

    Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

    Python 233 23

  5. ssast ssast Public

    Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

    Python 364 61

  6. ast ast Public

    Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

    Jupyter Notebook 1.2k 218