Uighur language and its parallel corresponding Chinese text data, 38,000 groups in total. They been cleaned, desensitized and gone through quality check. It can be used as base corpus for text data analysis in machine translation and related fields.
For more details, please refer to the link: https://www.nexdata.ai/datasets/nlu/194?source=Github
TXT
Uighur-Chinese Parallel Corpus Data
0.38 million pairs of Uighur-Chinese Parallel Corpus Data
Uighur, Chinese
machine translation
Commercial License