Skip to content

A mini project in the course D7041E, Applied Artificial Intelligence, Lp2, H22 at LTU.

License

Notifications You must be signed in to change notification settings

IsakLundstrom/D7041E-Mini-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Emotion and gender speech audio classification using CNN, NN and SVM - D7041E Mini Project

This the self choosen mini project for the course D7041E, Applied Artificial Intelligence, Lp2, H22 at LTU. The project choosen was to be able to classify emotion and gender from a data set of speech audio files, i.e. the RAVDESS Emotional speech audio dataset, using a CNN, a NN and a SVM.

Requirments

Python version 3.10.0 was used during the project, this might not be necessary but is what we used.

To install all required/used python packages type

pip install -r requirments.txt

in any terminal.

Run the python notebook project.ipynb in any supporting program, ex. Jupyer Notebook, VSCode or Google Colab.

Authors

Group 8

Video

The link to the video with the presentation of the project:

The dataset

The dataset used is from Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). In this project only the speech audio-only files where used, downloaded from here:

After downloading, unzip the file and put the folder audio_speech_actors_01-24 in the same directory as the project.ipynb file.

Results

To see all steps, results and conclusions checkout projects.ipynb.

About

A mini project in the course D7041E, Applied Artificial Intelligence, Lp2, H22 at LTU.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published