Bird Call Classification

The repo contains various materials created as a part of Cornell Birdcall Identification project (https://www.kaggle.com/c/birdsong-recognition/)

Files and Folders

Python Scripts

Birds Recognition Feature Selection.py - the script to implement the parallelized audio feature extraction flow using multiprocessing lib
Birds Recognition Feature Selection_single_process.py - the script to implement the single-process
dask_mp_feature_extraction.py - the script implementing the parallelized audio feature extraction using Dask
ray_mp_feature_extraction.py - the script implementing the parallelized audio feature extraction using Ray
utils.py - the module with various audio feature extraction utility functions
config.py - the configuration module of the project solution
lightgbm_training.py - the script with the experiments to train lightgbm model to do the multi-class classification for bird calls
log_regression_training.py - the script with the experiments to train logistic regression model to do the multi-class classification for bird calls
multiple_models_training.py - the script with the experiments to train 4 different models (random forest, Multinomial Naive Bayessian Classifier, Linear SVM Classifier, and logistic regression) to do the multi-class classification for bird calls

Jupyter Notebooks

Birds Calling EDA and Feature Importance.ipynb - the notebook to do EDA for the combined training set with the original tabular data and audio features extracted from the audio files, using Librosa

Data

data - the folder to download the competition data as published in https://www.kaggle.com/c/birdsong-recognition/data
interim_data - the folder for working interim output of various scripts within the solution
audio_features_data - the set of audio features extracted from each audio file provided as a part of the training set

Note: the files in audio_features_data subfolder have been produced via running Birds Recognition Feature Selection.py

References and Publications

The materials of this repo are used as suppelmentary assets for several publications below

'Pythonic Audio Feature Extraction in the Age of Parallel and Distributed Computing' (https://medium.com/@gvyshnya/pythonic-audio-feature-extraction-in-the-age-of-parallel-and-distributed-computing-793b27641b6d)
'Parallelized Audio Feature Extraction Study' (https://www.kaggle.com/c/birdsong-recognition/discussion/179662)
'Audio Feature Extraction: Best Practices' (https://www.kaggle.com/c/birdsong-recognition/discussion/172573)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bird Call Classification

Files and Folders

Python Scripts

Jupyter Notebooks

Data

References and Publications

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audio_features_data		audio_features_data
data		data
interim_data		interim_data
Birds Calling EDA and Feature Importance.ipynb		Birds Calling EDA and Feature Importance.ipynb
Birds Recognition Feature Selection.py		Birds Recognition Feature Selection.py
Birds Recognition Feature Selection_single_process.py		Birds Recognition Feature Selection_single_process.py
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dask_mp_feature_extraction.py		dask_mp_feature_extraction.py
lightgbm_training.py		lightgbm_training.py
log_regression_training.py		log_regression_training.py
multiple_models_training.py		multiple_models_training.py
ray_mp_feature_extraction.py		ray_mp_feature_extraction.py
utils.py		utils.py

License

gvyshnya/BirdCallClassification

Folders and files

Latest commit

History

Repository files navigation

Bird Call Classification

Files and Folders

Python Scripts

Jupyter Notebooks

Data

References and Publications

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages