This repository is the official implementations of our DCASE 2021 task 1a with technical report:
Soonshin Seo, Ji-Hwan Kim: "MobileNet using Coordinate Attention and Fusions for Low-Complexity Acoustic Scene Classification with Multiple Devices", submitted to task 1a of the 2021 DCASE Challenge
A technical report link at (http://dcase.community/documents/challenge2021/technical_reports/DCASE2021_Seo_52_t1.pdf)
- Download the development dataset form links at https://zenodo.org/record/3819968#.YLiqhfkzaUk
- Use the script "feats.py" & data augmentation scripts
- Use the script "train.py"
- Use the script "run.sh"
- Normalization & data augmentations
- MobileNet
- Cooridnate attention
- Early fusion & late fusion
We used the implementation presented in https://github.com/MihawkHu/DCASE2020_task1 as our baseline script.
@techreport{Seo_DCASE2021,
author = {Soonshin Seo, Ji-Hwan Kim},
title = {MobileNet using Coordinate Attention and Fusions for Low-Complexity Acoustic Scene Classification with Multiple Devices},
institution = {DCASE2021 Challenge},
year = {2021},
}