749-Hours-UAE-Arabic-Spontaneous-Speech-Data

Description

The 749 hour UAE Arabic Spontaneous Speech Data, the content covering multiple topics. All the speech audio was manually transcribed into text content; speaker identity, gender, and other attribution are also annotated. This dataset can be used for voiceprint recognition model training, corpus construction for machine translation, and algorithm research introduction

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1180?source=Github

Specifications

Format

16kHz, 16bit, mono channel;

Content category

Interview; Speech; Variety, etc.

Language

UAE Arabic;

Annotation

annotation for the transcription text, speaker identification, gender;

Application scenarios

speech recognition, video caption generation and video content review;

Accuracy

at a Sentence Accuracy Rate (SAR) of being no less than 95%.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
000001_1-1.txt		000001_1-1.txt
000001_1-1.wav		000001_1-1.wav
000001_1-10.txt		000001_1-10.txt
000001_1-10.wav		000001_1-10.wav
000001_1-2.txt		000001_1-2.txt
000001_1-2.wav		000001_1-2.wav
000001_1-3.txt		000001_1-3.txt
000001_1-3.wav		000001_1-3.wav
000001_1-4.txt		000001_1-4.txt
000001_1-4.wav		000001_1-4.wav
000001_1-5.txt		000001_1-5.txt
000001_1-5.wav		000001_1-5.wav
000001_1-6.txt		000001_1-6.txt
000001_1-6.wav		000001_1-6.wav
000001_1-7.txt		000001_1-7.txt
000001_1-7.wav		000001_1-7.wav
000001_1-8.txt		000001_1-8.txt
000001_1-8.wav		000001_1-8.wav
000001_1-9.txt		000001_1-9.txt
000001_1-9.wav		000001_1-9.wav
000001_1.metadata		000001_1.metadata
000001_1.txt		000001_1.txt
000001_1.wav		000001_1.wav
000198_3.metadata		000198_3.metadata
000198_3.txt		000198_3.txt
000198_3.wav		000198_3.wav
000674_1.metadata		000674_1.metadata
000674_1.txt		000674_1.txt
000674_1.wav		000674_1.wav
000912_1.metadata		000912_1.metadata
000912_1.txt		000912_1.txt
000912_1.wav		000912_1.wav
010531_1.metadata		010531_1.metadata
010531_1.txt		010531_1.txt
010531_1.wav		010531_1.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

749-Hours-UAE-Arabic-Spontaneous-Speech-Data

Description

Specifications

Format

Content category

Language

Annotation

Application scenarios

Accuracy

Licensing Information

About

Releases

Packages

Nexdata-AI/749-Hours-UAE-Arabic-Spontaneous-Speech-Data

Folders and files

Latest commit

History

Repository files navigation

749-Hours-UAE-Arabic-Spontaneous-Speech-Data

Description

Specifications

Format

Content category

Language

Annotation

Application scenarios

Accuracy

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages