Releases: vistec-AI/thai2nmt
Releases · vistec-AI/thai2nmt
scb-mt-en-th-2020 - v1.0
This dataset can be used to reproduce our experiments.
scb-mt-en-th-2020+mt-opus_v1.0
This dataset can be used to reproduce our experiments.
_backtranslated
files contain backtranslated zh
sentences from en
(only those with less than 1000 characters) via Google Cloud Translate API on 2021-06-15. Backtranslation sponsored by pnphannisa.
mt-opus
MT OPUS cleaned (max segment length: 500)
(to reproduce our experiments)