Skip to content

Commit

Permalink
build files
Browse files Browse the repository at this point in the history
  • Loading branch information
bagustris committed Jun 4, 2024
1 parent 7948337 commit 6d2199b
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ The table can be browsed, sorted and searched under https://superkogito.github.i
| <sub>[Quechua-SER](https://figshare.com/articles/media/Quechua_Collao_for_Speech_Emotion_Recognition/20292516)</sub> | <sub>2022</sub> | <sub>12420 audio recordings (~15 hours) and their transcriptions by 7 native speakers.</sub> | <sub>Emotional labels using dimensions: valence, arousal, and dominance.</sub> | <sub>Audio</sub> | <sub>3.53 GB</sub> | <sub>Quechua Collao</sub> | <sub>[A speech corpus of Quechua Collao for automatic dimensional emotion recognition](https://www.nature.com/articles/s41597-022-01855-9)</sub> | <sub>Open</sub> | <sub>[CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)</sub> |
| <sub>[MESD](https://data.mendeley.com/datasets/cy34mh68j9/5)</sub> | <sub>2022</sub> | <sub>864 audio files of single-word emotional utterances with Mexican cultural shaping.</sub> | <sub>6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.</sub> | <sub>Audio</sub> | <sub>0,097 GB</sub> | <sub>Spanish (Mexican)</sub> | <sub>[The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning](https://pubmed.ncbi.nlm.nih.gov/34891601/)</sub> | <sub>Open</sub> | <sub>[CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)</sub> |
| <sub>[SyntAct](https://zenodo.org/record/6573016#.ZAjy_9LMJpj)</sub> | <sub>2022</sub> | <sub>Synthesized database of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public </sub> | <sub>997 utterances including 6 emotions: angry, bored, happy, neutral, sad and scared</sub> | <sub>Audio</sub> | <sub>941 MB</sub> | <sub>German</sub> | <sub>[SyntAct: A Synthesized Database of Basic Emotions](http://felix.syntheticspeech.de/publications/synthetic_database.pdf)</sub> | <sub>Open</sub> | <sub>[CC BY-SA 4.0](https://creativecommons.org/licenses/by/4.0)</sub> |
| <sub>[LSSED](https://github.com/tobefans/LSSED)</sub> | <sub>2021</sub> | <sub>Large Scale Spanish Emotional Speech Database</sub> | <sub>8 emotions provides Spanish spoken utterances for anger, boredom, disgust, fear, happiness, neutral, sadness, and surprise.</sub> | <sub>Audio</sub> | <sub>90 GB</sub> | <sub>Spanish (Castilian)</sub> | <sub>[LSSED: A Large-Scale Spanish Emotional Speech Database for Speech Processing and Machine Learning](https://www.mdpi.com/1424-8220/21/23/6985)</sub> | <sub>Restricted</sub> | <sub>[-](https://creativecommons.org/licenses/by-sa/4.0/)</sub> |
| <sub>[LSSED](https://github.com/tobefans/LSSED)</sub> | <sub>2021</sub> | <sub>LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition</sub> | <sub>Anger, happiness, sadness, disappointment, boredom, disgust, excitement, fear, surprise, normal, and other.</sub> | <sub>Audio</sub> | <sub>90 GB</sub> | <sub>English</sub> | <sub>[LSSED: A Large-Scale Spanish Emotional Speech Database for Speech Processing and Machine Learning](https://arxiv.org/abs/2102.01754)</sub> | <sub>Restricted</sub> | <sub>[-](https://github.com/tobefans/LSSED/blob/main/EULA.pdf)</sub> |
| <sub>[MLEnd](https://www.kaggle.com/datasets/jesusrequena/mlend-spoken-numerals)</sub> | <sub>2021</sub> | <sub>~32700 audio recordings files produced by 154 speakers. Each audio recording corresponds to one English numeral (from "zero" to "billion")</sub> | <sub>Intonations: neutral, bored, excited and question</sub> | <sub>Audio</sub> | <sub>2.27 GB</sub> | <sub>--</sub> | <sub>--</sub> | <sub>Open</sub> | <sub>Unknown</sub> |
| <sub>[ASVP-ESD](https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances)</sub> | <sub>2021</sub> | <sub>~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech.</sub> | <sub>12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity.</sub> | <sub>Audio</sub> | <sub>2 GB</sub> | <sub>Chinese, English, French, Russian and others</sub> | <sub>--</sub> | <sub>Open</sub> | <sub>Unknown</sub> |
| <sub>[ESD](https://hltsingapore.github.io/ESD/)</sub> | <sub>2021</sub> | <sub>29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers.</sub> | <sub>5 emotions: angry, happy, neutral, sad, and surprise.</sub> | <sub>Audio, Text</sub> | <sub>2.4 GB (zip)</sub> | <sub>Chinese, English</sub> | <sub>[Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset](https://arxiv.org/pdf/2010.14794.pdf)</sub> | <sub>Open</sub> | <sub>Academic License</sub> |
Expand Down
2 changes: 1 addition & 1 deletion src/ser-datasets.csv
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ Dataset,Year,Content,Emotions,Format,Size,Language,Paper,Access,License
`Quechua-SER <https://figshare.com/articles/media/Quechua_Collao_for_Speech_Emotion_Recognition/20292516>`_,2022,12420 audio recordings (~15 hours) and their transcriptions by 7 native speakers.,"Emotional labels using dimensions: valence, arousal, and dominance.",Audio,3.53 GB,Quechua Collao,`A speech corpus of Quechua Collao for automatic dimensional emotion recognition <https://www.nature.com/articles/s41597-022-01855-9>`_,Open,`CC BY 4.0 <https://creativecommons.org/licenses/by/4.0/>`_
`MESD <https://data.mendeley.com/datasets/cy34mh68j9/5>`_,2022,864 audio files of single-word emotional utterances with Mexican cultural shaping.,"6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.",Audio,"0,097 GB",Spanish (Mexican),`The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning <https://pubmed.ncbi.nlm.nih.gov/34891601/>`_,Open,`CC BY 4.0 <https://creativecommons.org/licenses/by/4.0/>`_
`SyntAct <https://zenodo.org/record/6573016#.ZAjy_9LMJpj>`_,2022,Synthesized database of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public ,"997 utterances including 6 emotions: angry, bored, happy, neutral, sad and scared",Audio,941 MB,German,`SyntAct: A Synthesized Database of Basic Emotions <http://felix.syntheticspeech.de/publications/synthetic_database.pdf>`_,Open,`CC BY-SA 4.0 <https://creativecommons.org/licenses/by/4.0>`_
`LSSED <https://github.com/tobefans/LSSED>`_,2021,Large Scale Spanish Emotional Speech Database,"8 emotions provides Spanish spoken utterances for anger, boredom, disgust, fear, happiness, neutral, sadness, and surprise.",Audio,90 GB,Spanish (Castilian),`LSSED: A Large-Scale Spanish Emotional Speech Database for Speech Processing and Machine Learning <https://www.mdpi.com/1424-8220/21/23/6985>`_,Restricted,`- <https://creativecommons.org/licenses/by-sa/4.0/>`_
`LSSED <https://github.com/tobefans/LSSED>`_,2021,LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition,"Anger, happiness, sadness, disappointment, boredom, disgust, excitement, fear, surprise, normal, and other.",Audio,90 GB,English,`LSSED: A Large-Scale Spanish Emotional Speech Database for Speech Processing and Machine Learning <https://arxiv.org/abs/2102.01754>`_,Restricted,`- <https://github.com/tobefans/LSSED/blob/main/EULA.pdf>`_
`MLEnd <https://www.kaggle.com/datasets/jesusrequena/mlend-spoken-numerals>`_,2021,"~32700 audio recordings files produced by 154 speakers. Each audio recording corresponds to one English numeral (from ""zero"" to ""billion"")","Intonations: neutral, bored, excited and question",Audio,2.27 GB,--,--,Open,Unknown
`ASVP-ESD <https://www.kaggle.com/datasets/dejolilandry/asvpesdspeech-nonspeech-emotional-utterances>`_,2021,"~13285 audio files collected from movies, tv shows and youtube containing speech and non-speech.","12 different natural emotions (boredom, neutral, happiness, sadness, anger, fear, surprise, disgust, excitement, pleasure, pain, disappointment) with 2 levels of intensity.",Audio,2 GB,"Chinese, English, French, Russian and others",--,Open,Unknown
`ESD <https://hltsingapore.github.io/ESD/>`_,2021,"29 hours, 3500 sentences, by 10 native English speakers and 10 native Chinese speakers.","5 emotions: angry, happy, neutral, sad, and surprise.","Audio, Text",2.4 GB (zip),"Chinese, English",`Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset <https://arxiv.org/pdf/2010.14794.pdf>`_,Open,Academic License
Expand Down

0 comments on commit 6d2199b

Please sign in to comment.