Emotion and gender speech audio classification using CNN, NN and SVM - D7041E Mini Project

This the self choosen mini project for the course D7041E, Applied Artificial Intelligence, Lp2, H22 at LTU. The project choosen was to be able to classify emotion and gender from a data set of speech audio files, i.e. the RAVDESS Emotional speech audio dataset, using a CNN, a NN and a SVM.

Requirments

Python version 3.10.0 was used during the project, this might not be necessary but is what we used.

To install all required/used python packages type

pip install -r requirments.txt

in any terminal.

Run the python notebook project.ipynb in any supporting program, ex. Jupyer Notebook, VSCode or Google Colab.

Authors

Group 8

Isak Lundmark - lunisa-9@student.ltu.se
Isak Lundström - isalun-9@student.ltu.se
Ludvig Hedlund - ludhed-8@student.ltu.se

Video

The link to the video with the presentation of the project:

https://www.youtube.com/watch?v=kF6cNwgFS3Y

The dataset

The dataset used is from Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). In this project only the speech audio-only files where used, downloaded from here:

https://www.kaggle.com/datasets/uwrfkaggler/ravdess-emotional-speech-audio

After downloading, unzip the file and put the folder audio_speech_actors_01-24 in the same directory as the project.ipynb file.

Results

To see all steps, results and conclusions checkout projects.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
project.ipynb		project.ipynb
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion and gender speech audio classification using CNN, NN and SVM - D7041E Mini Project

Requirments

Authors

Video

The dataset

Results

About

Releases

Packages

Contributors 3

Languages

License

IsakLundstrom/D7041E-Mini-Project

Folders and files

Latest commit

History

Repository files navigation

Emotion and gender speech audio classification using CNN, NN and SVM - D7041E Mini Project

Requirments

Authors

Video

The dataset

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages