101-Hours-Italian-Children-Spontaneous-Speech-Data

Description

The 101 Hours - Italian Child's Spontaneous Speech Data, manually screened and processed. Annotation contains transcription text, speaker identification, gender and other informantion. This dataset can be applied in speech recognition (acoustic model or language model training), caption generation, voice content moderation and other AI algorithm research.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1300?source=Github

Specifications

Format

16k Hz, 16 bit, wav, mono channel;

Age

12 years old and younger children;

Content category

including self-media, conversation, live, lecture, variety show;

Language

Italian

Annotation

annotation for the transcription text, speaker identification, gender;

Accuracy

Word Accuracy Rate (WAR) at least 98%.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
000002_1.txt		000002_1.txt
000002_1.wav		000002_1.wav
000002_10.txt		000002_10.txt
000002_10.wav		000002_10.wav
000002_11.txt		000002_11.txt
000002_11.wav		000002_11.wav
000002_2.txt		000002_2.txt
000002_2.wav		000002_2.wav
000002_3.txt		000002_3.txt
000002_3.wav		000002_3.wav
000002_4.txt		000002_4.txt
000002_4.wav		000002_4.wav
000002_5.txt		000002_5.txt
000002_5.wav		000002_5.wav
000002_6.txt		000002_6.txt
000002_6.wav		000002_6.wav
000002_7.txt		000002_7.txt
000002_7.wav		000002_7.wav
000002_8.txt		000002_8.txt
000002_8.wav		000002_8.wav
000002_9.txt		000002_9.txt
000002_9.wav		000002_9.wav
000003_1.txt		000003_1.txt
000003_1.wav		000003_1.wav
000003_10.txt		000003_10.txt
000003_10.wav		000003_10.wav
000003_11.txt		000003_11.txt
000003_11.wav		000003_11.wav
000003_2.txt		000003_2.txt
000003_2.wav		000003_2.wav
000003_3.txt		000003_3.txt
000003_3.wav		000003_3.wav
000003_4.txt		000003_4.txt
000003_4.wav		000003_4.wav
000003_5.txt		000003_5.txt
000003_5.wav		000003_5.wav
000003_6.txt		000003_6.txt
000003_6.wav		000003_6.wav
000003_7.txt		000003_7.txt
000003_7.wav		000003_7.wav
000003_8.txt		000003_8.txt
000003_8.wav		000003_8.wav
000003_9.txt		000003_9.txt
000003_9.wav		000003_9.wav
000004_1.txt		000004_1.txt
000004_1.wav		000004_1.wav
000004_10.txt		000004_10.txt
000004_10.wav		000004_10.wav
000004_11.txt		000004_11.txt
000004_11.wav		000004_11.wav
000004_2.txt		000004_2.txt
000004_2.wav		000004_2.wav
000004_3.txt		000004_3.txt
000004_3.wav		000004_3.wav
000004_4.txt		000004_4.txt
000004_4.wav		000004_4.wav
000004_5.txt		000004_5.txt
000004_5.wav		000004_5.wav
000004_6.txt		000004_6.txt
000004_6.wav		000004_6.wav
000004_7.txt		000004_7.txt
000004_7.wav		000004_7.wav
000004_8.txt		000004_8.txt
000004_8.wav		000004_8.wav
000004_9.txt		000004_9.txt
000004_9.wav		000004_9.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

101-Hours-Italian-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Releases

Packages

Nexdata-AI/101-Hours-Italian-Children-Spontaneous-Speech-Data

Folders and files

Latest commit

History

Repository files navigation

101-Hours-Italian-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages