Skip to content

Nexdata-AI/380000-Groups-Japanese-English-Parallel-Corpus-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

380000-Groups-Japanese-English-Parallel-Corpus-Data

Description

Japanese and English parallel corpus, 380,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

For more details, please refer to the link: https://www.nexdata.ai/datasets/nlu/153?source=Github

Specifications

Storage format

TXT

Data content

Japanese-English Parallel Corpus Data

Data size

0.38 million pairs of Japanese-English Parallel Corpus Data

Language

Japanese, English

Application scenario

machine translation

Licensing Information

Commercial License