Cornell Birdcall Identification competition

This is the code for Cornell Birdcall Identification challenge hosted on Kaggle

Data

Librosa library is pretty slow for reading and transforming audio. So, I read data using librosa and saved it as HDF5 file. More about that you can read here.

Script for transforming .mp3 to hdf5: create/read_and_transform_audio.py

Augmentations

Augmentations are useful for better models generalization. I've used albumentations library and this Kaggle notebook to build augmentations for spectrograms transforming.

Code for this part tou can find here: modules/data/augmentations

Model

I've used CNN for image classification for this task. Family of EfficientNet models is the SOTA for image classification now, so I chose it. Also I've used PyTorch Lightning to build training pipeline.

Model part: modules/model

Built With

PyTorch - Neural networks framework used
PyTorch Lightning - For training pipeline
Albumentations - Fot spectrogram augmentaions

Authors

Vadim Titko aka Vadbeg - GitHub | LinkedIn

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
create		create
modules		modules
.gitignore		.gitignore
README.md		README.md
evaluation.py		evaluation.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cornell Birdcall Identification competition

Data

Augmentations

Model

Built With

Authors

About

Releases

Packages

Languages

Vadbeg/Birds

Folders and files

Latest commit

History

Repository files navigation

Cornell Birdcall Identification competition

Data

Augmentations

Model

Built With

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages