Skip to content

Nexdata-AI/1080000-Groups-English-Russian-Parallel-Corpus-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

1080000-Groups-English-Russian-Parallel-Corpus-Data

Description

English and Russian parallel corpus, 1,080,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

For more details, please refer to the link: https://www.nexdata.ai/datasets/nlu/1161?source=Github

Specifications

Storage format

TXT

Data content

English-Russian Parallel Corpus Data

Data size

1.08 million pairs of English-Russian Parallel Corpus Data

Language

English,Russian

Application scenario

machine translation

Licensing Information

Commercial License