Skip to content

Nexdata-AI/1340000-Groups-English-Korean-Parallel-Corpus-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

1340000-Groups-English-Korean-Parallel-Corpus-Data

Description

English and Korean parallel corpus, 1340,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

For more details, please refer to the link: https://www.nexdata.ai/datasets/nlu/154?source=Github

Specifications

Storage format

TXT

Data content

English-Korean Parallel Corpus Data

Data size

1.34 million pairs of English-Korean Parallel Corpus Data

Language

English,Korean

Application scenario

machine translation

Accuracy rate

90%

Licensing Information

Commercial License