Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training & languages #4

Open
Mlallena opened this issue Jul 8, 2021 · 12 comments
Open

Training & languages #4

Mlallena opened this issue Jul 8, 2021 · 12 comments

Comments

@Mlallena
Copy link

Mlallena commented Jul 8, 2021

If I wanted to add new languages to this program, or train the ones already present, how would I have to do it?

Also, you should update the link at the first instruction - I had to replace "latest" for "1.0.1" so I could download it.

@igorsitdikov
Copy link
Owner

igorsitdikov commented Jul 8, 2021

Thank you. Unfortunately you have to train your own model for new language.
Or you can try https://huggingface.co/TalTechNLP/voxlingua107-epaca-tdnn

@Mlallena
Copy link
Author

Mlallena commented Jul 9, 2021

What did you use to train your own model? I'm asking because (unless I missed something) this repository doesn't have any code that is clearly used for training.

@igorsitdikov
Copy link
Owner

have a look #1

@Mlallena
Copy link
Author

Mlallena commented Jul 9, 2021

Thanks, I'll have a look.

@Mlallena
Copy link
Author

OK, I have been checking, and it could work. Thing is, from what you said in #1, the only modification you make would be to the utt2spk file, but where would this file be stored? I'm going to go out on a limb and say that it is stored in a data folder within v2, but the main problem is that the run.sh file doesn't refer to that file. I'd also have to modify which corpus it is trying to target, since the audios are in a different folder.

Any help you can give me would be welcome.

@Mlallena Mlallena changed the title Training & languges Training & languages Jul 16, 2021
@asadullah797
Copy link

asadullah797 commented Mar 27, 2022

Hi Igor,
I am training Kaldi recipe on voxlingua data for language identification task but I could not find trials file.
Can you please share with me the trials file.
Many thanks.

@igorsitdikov
Copy link
Owner

igorsitdikov commented Mar 28, 2022

Hello @asadullah797. You can generate file on your own.
It will look something like this:

lang-id-A utt-id-A target
lang-id-A utt-id-B nontarget
lang-id-A utt-id-C nontarget
lang-id-B utt-id-A nontarget
lang-id-B utt-id-B target

for 3 files and 3 languages:

en utt-en target
en utt-ru nontarget
en utt-pl nontarget
ru utt-en nontarget
ru utt-ru target
ru utt-pl nontarget
pl utt-en nontarget
pl utt-ru nontarget
pl utt-pl target

Sorry I don't remember, probably columns 1 and 2 should be swapped

@asadullah797
Copy link

For lang id task; how can you define

lang-id-A utt-id-B nontarget

I mean how can you decide whether the given utterance is target/non-target.
Thanks

@igorsitdikov
Copy link
Owner

you have dataset with 3 languages, each wav file has only one language, you should have map wav file - language, so it will be target. all other 3 languages will be nontarget for the file.

@asadullah797
Copy link

Just to confirm;
(wav1:>en, wav2:>es, wav3:>de)
en wav1 target
es wav1 nontarget
de wav1 nontarget
and so on for other cases as well.

@igorsitdikov
Copy link
Owner

I think so. But as I wrote before, if it will not work, try to swap columns 1 and 2
like this. Sorry really don't remember.
wav1 en target
wav1 es nontarget
wav1 de nontarget

@asadullah73-ce
Copy link

Hi Igor;
I have prepared trials file using (https://github.com/kaldi-asr/kaldi/blob/master/egs/aishell/v1/local/produce_trials.py) but at the end of the script I am getting this kind of error:
Key de__071xs-uBRZo__U__S10---0150.960-0167.120 not present in training iVectors
The key is the utterance_id in above.
Please note that I have created trials file from test data utt2spk.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants