SpeechEmotionClassifier

Identify 6-classes of emotion based on audio (speech) signal. Using Mel spectrogram to capture speech features and a CNN architecture for feature extraction

The best model is Model2.ipynb - Please open the jupyter notebook to view the architecture and results. The Confusion Matrix, Classification Report and Validation results are present in all models.

The Model 1 are transfer learning models - just an attempt to examine how spectrogram performs with existing image models.

Full report is available on request. All folders with relevant files to run (without filtering) are available at https://drive.google.com/drive/folders/1IildW2vjEOvcHgVBWTHVwkGyXLlCpY6v?usp=sharing

Repo owners (see contributors):

Taslim Mahbub

Azadan Bhagwagar

Priyanka Chand

Link to Kaggle dataset : https://www.kaggle.com/ejlok1/cremad

Link to Github for Demographics file: https://github.com/CheyneyComputerScience/CREMA-D

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.gitignore		.gitignore
Female melspecs.ipynb		Female melspecs.ipynb
FemaleMFCC.ipynb		FemaleMFCC.ipynb
Model1AInceptionV3.ipynb		Model1AInceptionV3.ipynb
Model1BMobileNetV2.ipynb		Model1BMobileNetV2.ipynb
Model2.ipynb		Model2.ipynb
Model3B.ipynb		Model3B.ipynb
README.md		README.md
TrainingEvaluationEmotionTemplate.ipynb		TrainingEvaluationEmotionTemplate.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpeechEmotionClassifier

About

Releases

Packages

Contributors 3

Languages

Taslim-M/SpeechEmotionClassifier

Folders and files

Latest commit

History

Repository files navigation

SpeechEmotionClassifier

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages