Install environment

vits2-vietnam

Follow like this:

Install environment

conda create -y -n whisper python=3.8
conda activate whisper
conda install cudatoolkit=11.3.1 cudnn=8.2.1
pip install -r requirements.txt

Process data

Prepare dataset

if download from youtube: yt-dlp -x --audio-format wav --batch-file youtube_links.txt
python Step0_merge_input.py (create each file <= 50k character )
Upload text to vbee.vn => download
python Step1_split_audio.py
python Step2_transcription_whisper.py
python Step2_transcription_wav2vec.py
python Step3_map_file.py

Create label

map predict with ground truth => file map
check file map with predict
If use from raw audio from youtube use postprocess.ipynb and download in folder crawl youtube
pyton Step4_create_label.py

remember cover audio to mono: python check_type_audio.py

Trainning in vits

git clone https://github.com/p0p4k/vits2_pytorch.git 

cd vit2_pytorch

cd monotonic_align

python setup.py build_ext --inplace

cd ..

add line "_letters = '0123456789aáảàãạâấẩầẫậăắẳằẵặbcdđeéẻèẽẹêếểềễệfghiíỉìĩịjklmnoóỏòõọôốổồỗộơớởờỡợpqrstuúủùũụưứửừữựvwxyýỷỳỹỵz'" in text/symbols.py
add line "logging.getLogger('numba').setLevel(logging.WARNING)" in utils.py (ignore warning numba)
copy file vits2_blv_AQ.json into config

python preprocess.py --text_index 1 --filelists filelists/infore_audio_text_train_filelist.txt filelists/infore_audio_text_val_filelist.txt filelists/infore_audio_text_test_filelist.txt --text_cleaners basic_cleaners

python train.py -c config/vits2_blv_AQ -m blv_AQ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install environment

Process data

Prepare dataset

Create label

Trainning in vits

Credits

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
crawl youtube		crawl youtube
postprocess		postprocess
.gitignore		.gitignore
README.md		README.md
Step0_merge_input.py		Step0_merge_input.py
Step1_split_audio.py		Step1_split_audio.py
Step2_transcription_wav2vec.py		Step2_transcription_wav2vec.py
Step2_transcription_whisper.py		Step2_transcription_whisper.py
Step3_map_file.py		Step3_map_file.py
Step4_create_label.py		Step4_create_label.py
check_type_audio.py		check_type_audio.py
postprocess.ipynb		postprocess.ipynb
requirements.txt		requirements.txt
slicer2.py		slicer2.py
split_label.py		split_label.py
vits2_blv_AQ.json		vits2_blv_AQ.json
youtube_links.txt		youtube_links.txt

datnt153/vits2-vietnam

Folders and files

Latest commit

History

Repository files navigation

Install environment

Process data

Prepare dataset

Create label

Trainning in vits

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages