A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
tts persian speech-processing asr forced-alignment speech-dataset persian-speech-recognition asr-evaluation persian-speech-dataset persian-text-to-speech speech-data-collection persian-speech-corpus
-
Updated
Sep 13, 2024 - Jupyter Notebook