Does non-English TTS training work properly now? #4606

aidosRepoint · 2022-07-25T09:03:37Z

aidosRepoint
Jul 25, 2022

Hi all!

I've come across this issue from 2020, which clearly shows that non-English TTS had some issues at that time. Has the state of things changed since then?

I need to train Tacotron2 for Russian, but I face so many troubles in preprocessing stage. Mainly with text preprocessing. Unfortunately, I couldn't find any decent tutorials on how to train a model for a non-English dataset.

Can anyone help?

Answered by redoctopus

Jul 25, 2022

Our current tokenizers are for English and German characters, so you may need to add a new tokenizer (and possibly G2P) class for training for Russian. @aroraakshit wrote this tutorial for adding German support to TTS and may be able to answer other questions you might have about adding additional language support.

View full answer

redoctopus · 2022-07-25T21:11:39Z

redoctopus
Jul 25, 2022
Collaborator

Our current tokenizers are for English and German characters, so you may need to add a new tokenizer (and possibly G2P) class for training for Russian. @aroraakshit wrote this tutorial for adding German support to TTS and may be able to answer other questions you might have about adding additional language support.

2 replies

NoteToSelfFindGoodNickname Sep 4, 2022

Can you please edit the link? It currently points to 404.

redoctopus Sep 6, 2022
Collaborator

Edited, it should work now.

laurentbenaroya · 2024-07-22T14:26:33Z

laurentbenaroya
Jul 22, 2024

Hello,
I am trying to train a French TTS based on FastPitch. I have at my disposal a speech database with 120 speakers and more than 9000 utterances. Is there some guidelines or a tutorial in order to support a new language ? I am pretty new on this part.

1 reply

redoctopus Jul 22, 2024
Collaborator

The link above to the tutorial for adding German should still work: https://github.com/NVIDIA/NeMo/blob/main/tutorials/tts/FastPitch_GermanTTS_Training.ipynb

Please follow the same steps as in the notebook for your data.

laurentbenaroya · 2024-07-22T23:19:27Z

laurentbenaroya
Jul 22, 2024

Thanks Jocelyn for your quick answer.
Still remains a problem with the tokenizer. In the tutorial, a German tokenizer is used, which is referenced in the file: https://raw.githubusercontent.com/NVIDIA/NeMo/main/examples/tts/conf/de/fastpitch_align_22050_grapheme.yaml, which is downloaded in the notebook and contains the following lines :
text_tokenizer:
target: nemo.collections.common.tokenizers.text_to_speech.tts_tokenizers.GermanCharsTokenizer
punct: true
apostrophe: true
pad_with_space: true

But there is no equivalent for the French language. I am not experienced enough to code such a tokenizer.

2 replies

redoctopus Jul 22, 2024
Collaborator

Unfortunately we can't really write a truly generic tokenizer, since a lot of symbols for punctuation and valid characters don't overlap between languages. You might be able to try using the default English one (or Spanish or German, seeing whatever is closest to what you need), but that's not going to be ideal and will probably cause some issues.

@XuesongYang and @rlangman for any additional thoughts/ideas?

XuesongYang Jul 22, 2024
Collaborator

we support French along with ["en-US", "de-DE", "es-ES", "it-IT", "fr-FR"]. Try FrenchCharsTokenizer

laurentbenaroya · 2024-07-24T08:40:14Z

laurentbenaroya
Jul 24, 2024

Thank you for your answer. I'm currently making a multilingual notebook in French, but following the one for German at the same time. I think I also need a pronunciation dictionary and a file for heteronymous words? Is that right? The dictionary can be based on all the words in my database. However, I haven't found any resources on the Internet for French. Le mar. 23 juil. 2024 à 01:45, Xuesong Yang ***@***.***> a écrit :

…

we support French along with ["en-US", "de-DE", "es-ES", "it-IT", "fr-FR"]. Try FrenchCharsTokenizer <https://github.com/NVIDIA/NeMo/blob/main/nemo/collections/common/tokenizers/text_to_speech/tts_tokenizers.py#L272> — Reply to this email directly, view it on GitHub <#4606 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADERB2K6MGLA3HUX2YHN7MLZNWKQNAVCNFSM6AAAAABLIOEIUGVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMJSGAYTONY> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

laurentbenaroya · 2024-07-24T10:45:42Z

laurentbenaroya
Jul 24, 2024

I found the 21 French heteronyms in https://en.wiktionary.org/wiki/Category:French_heteronyms and a dictionary : https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/French/ if this can help someone else.. Le mer. 24 juil. 2024 à 10:39, Elie-Laurent Benaroya < ***@***.***> a écrit :

…

Thank you for your answer. I'm currently making a multilingual notebook in French, but following the one for German at the same time. I think I also need a pronunciation dictionary and a file for heteronymous words? Is that right? The dictionary can be based on all the words in my database. However, I haven't found any resources on the Internet for French. Le mar. 23 juil. 2024 à 01:45, Xuesong Yang ***@***.***> a écrit : > we support French along with ["en-US", "de-DE", "es-ES", "it-IT", > "fr-FR"]. Try FrenchCharsTokenizer > <https://github.com/NVIDIA/NeMo/blob/main/nemo/collections/common/tokenizers/text_to_speech/tts_tokenizers.py#L272> > > — > Reply to this email directly, view it on GitHub > <#4606 (reply in thread)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/ADERB2K6MGLA3HUX2YHN7MLZNWKQNAVCNFSM6AAAAABLIOEIUGVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTAMJSGAYTONY> > . > You are receiving this because you commented.Message ID: > ***@***.***> >

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does non-English TTS training work properly now? #4606

{{title}}

Replies: 5 comments 5 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Does non-English TTS training work properly now? #4606

aidosRepoint Jul 25, 2022

Replies: 5 comments · 5 replies

redoctopus Jul 25, 2022 Collaborator

NoteToSelfFindGoodNickname Sep 4, 2022

redoctopus Sep 6, 2022 Collaborator

laurentbenaroya Jul 22, 2024

redoctopus Jul 22, 2024 Collaborator

laurentbenaroya Jul 22, 2024

redoctopus Jul 22, 2024 Collaborator

XuesongYang Jul 22, 2024 Collaborator

laurentbenaroya Jul 24, 2024

laurentbenaroya Jul 24, 2024

aidosRepoint
Jul 25, 2022

Replies: 5 comments 5 replies

redoctopus
Jul 25, 2022
Collaborator

redoctopus Sep 6, 2022
Collaborator

laurentbenaroya
Jul 22, 2024

redoctopus Jul 22, 2024
Collaborator

laurentbenaroya
Jul 22, 2024

redoctopus Jul 22, 2024
Collaborator

XuesongYang Jul 22, 2024
Collaborator

laurentbenaroya
Jul 24, 2024

laurentbenaroya
Jul 24, 2024