GitHub - hrahmansha/TTS_Bn: CNN based bangla text-to-speech model with Attention mechanism.

This bangla text to speech model is a CNN based architecture with Attention mechanism.

Methodology based on : Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention

Implementation based on: pytorch-dc-tts

Bengali Text to Speech Dataset: Bangla tts dataset by google contains approximately 3100 bangla sentences. This dataset was collected from native Indian Bengali and Bangladesh Bengali speakers.

Result

As there was hardware limitation, the training for the coarse mel spectrogram to the full STFT spectrogram was done only for 60 iterations. The audio samples and pretrained models can be found here link

About The Model Architecture

This TTS model consists of two networks: (1) Text2Mel, which synthesize a mel spectrogram from an input text, and (2) Spectrogram Super-resolution Network (SSRN), which convert a coarse mel spectrogram to the full STFT(Short-time Fourier transform) spectrogram. Figure below shows the overall architecture of the model. For more read this

Training Process

Download the dataset into /datasets folder
Preprocess the dataset.
Train the Text2Mel model
Train the SSRN model`
Test the model

Colab Notebook : (https://colab.research.google.com/drive/1AjsxzBu6ekcv0GF3dyWubj04hhwwHkjE?usp=sharing) This colab playground might seems to be a total mess.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
datasets		datasets
models		models
LICENSE		LICENSE
README.md		README.md
_gitignore		_gitignore
audio.py		audio.py
hparams.py		hparams.py
logger.py		logger.py
model.png		model.png
requirements.txt		requirements.txt
train-ssrn.py		train-ssrn.py
train-text2mel.py		train-text2mel.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Result

About The Model Architecture

Training Process

About

Releases

Packages

Languages

License

hrahmansha/TTS_Bn

Folders and files

Latest commit

History

Repository files navigation

Result

About The Model Architecture

Training Process

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages