Skip to content

hdsceokevin/NKNLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NKNLP

This repo deals with NLP for North Korean.

Web crawling for North Korean dictionary

North Korean Dictionary.R is about collecting North Korean Words from the Standard Korean Dictionary of National Institute of Korean Language. NKwords.RDS, the first output of the web crawler contains text data from the website, and I did text data preparation for offering core parts of it. So, If you want to get directly North Korean Dictionary, just download NKdictioary.RDS and run readRDS() function on R or Rstudio.

To do list

1. Web crawling for North Korean dictionary.
2. Web carwling for North Korean articles from a couple of websites, such as newspapers or communities.
3. Text mining with articles using dictionary. (NLP4kec, RcppMeCab packages)

About

This repo deals with NLP for North Korean.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages