wav2letter--

Repository for Korean speech-to-text.

Results

Note that current version does not include any LM (Language Model).
Values of WER (Word Error Rate) on the graph is actually a CER (Character Error Rate).
Took almost a month to train this model.

	Texts
Answer label	데 뭐 쾌락이라는 게 기분이 좋은 거잖아. 데 기분이 좋다고 해서 모두가 다 나쁜 걸까, 라는 생각을 해.
Output	근데 뭐 켸라ᅡᆨ이라는 게 기분이 좋은 거잖아ᅡ. 그 ᄀ기분이 좋다ᄀ고 해서 모두가 ᄃ다다ᅡ쁜 거ᅥᆯ까 라는 새ᅢᆼ각을 해.
CTC Decoded	근데 뭐 켸락이라는 게 기분이 좋은 거잖아. 그 기분이 좋다고 해서 모두가 다다쁜 걸까 라는 생각을 해.

Name	url
KSS	https://www.kaggle.com/bryanpark/korean-single-speaker-speech-dataset
Clova Call	https://github.com/clovaai/ClovaCall
Zeroth	https://github.com/goodatlas/zeroth
KsponSpeech	https://aihub.or.kr/aidata/105/download
ProSem	https://github.com/warnikchow/prosem
Pansori	https://github.com/yc9701/pansori-tedxkr-corpus
Acryl	http://aicompanion.or.kr/kor/tech/data.php

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
figs		figs
models		models
texts		texts
waveglow @ 8afb643		waveglow @ 8afb643
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
config.json		config.json
distributed.py		distributed.py
inference.py		inference.py
mel2samp_waveglow.py		mel2samp_waveglow.py
prepare_batch.py		prepare_batch.py
run.py		run.py