Skip to content

Ragnacodes/NLP-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLP-Project

Natural Processing Language Project - IUST 2022

Overview

Spelling correction is a well-known task in Natural Language Processing (NLP). Automatic spelling correction is essential for many NLP applications like web search engines, text summarization, sentiment analysis, etc.

Directories

Report Files

You can open the markdown file or download the PDF files.

The phase of the project Markdown file PDF file
P1 Open Open

Run the code

Update the dataset

To update the dataset from scratch, you need to follow these steps:

  • Crawl the raw data.

  • Preprocess the data.

  • Add noise to the data.

You can run these steps by a single command.

# First of all, you need to provide your wordlist.
# The default wordlist is `data/wordlist.txt`,
# You can download English dictionary by running ./src/download_wordlist.sh
./run.sh

In the end, it will report some metrics and plot the histogram.

About

Natural Language Processing Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published