Skip to content

Python library for processing Japanese text.

License

Notifications You must be signed in to change notification settings

zeroesones/protext

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

protext: Simplified Japanese Text Processing

protext is a Python library for processing Japanese text. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more.

from protext import ProText

text = """
"""

doc = ProText(text)
doc.tags

doc.noun_phrases

Features

  • Noun phrase extraction
  • Part-of-speech tagging
  • Tokenization (splitting text into words and sentences)
  • Parsing
  • Word inflection (pluralization and singularization) and lemmatization
  • WordNet integration
  • Word Embeddings

Installation

To install protext, simply run:

$ pip install protext
$ python -m protext.download_corpora

License

MIT licensed. See the bundled LICENSE file for more details.

About

Python library for processing Japanese text.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%