speech-data-collection

Here are 3 public repositories matching this topic...

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

Star

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

Updated Sep 22, 2024

MahtaFetrat / VirgoolInformal-Speech-Dataset

Star

A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.

tts persian speech-processing asr forced-alignment speech-dataset persian-speech-recognition asr-evaluation persian-speech-dataset persian-text-to-speech speech-data-collection persian-speech-corpus

Updated Sep 13, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-data-collection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-data-collection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly