Skip to content
JCharis Jesse edited this page Mar 23, 2020 · 1 revision

Welcome to NeatText

NeatText (neattext) is a simple python NLP package for cleaning textual data and for processing text when performing NLP and ML projects.It was designed to solve the following problem

Problem Neattext is intended to solve

  • Cleaning of unstructured text data
  • Reduce noise [special characters,stopwords]
  • Reducing repetition of using the same code for text preprocessing

The NeatText project is maintained by @jcharis but contributors are gladly welcomed.

Features

  • Removing of Noise In Text
    • special characters
    • emails
    • numbers/phone numbers
    • emojis
  • Dealing with stopwords
  • Extracting of emails,numbers,emoji,etc
  • Textmetrics : word statistics
  • Normalizing text

Getting Started

Installation

  • using pip
pip install neattext
Clone this wiki locally