Skip to content

Nexdata-AI/750000-Groups-Chinese-Burmese-Parallel-Corpus-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

750000-Groups-Chinese-Burmese-Parallel-Corpus-Data

Description

0.75 Million Pairs of Sentences - Chinese-Burmese Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

For more details, please refer to the link: https://www.nexdata.ai/datasets/nlu/1184?source=Github

Storage format

TXT

Data content

Chinese-Burmese Parallel Corpus Data

Data size

0.75 million pairs of Chinese-Burmese Parallel Corpus Data. The Chinese sentences contain 18 characters on average.

Language

Chinese, Burmese

Application scenario

machine translation

Licensing Information

Commercial License