Skip to content

Parts of Speech Tagger trained on nltk treebank corpus leveraging a Decision Tree Classifier.

License

Notifications You must be signed in to change notification settings

iamv1n/pos-tagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

POS Tagger

License

Description

POS Tagger is a Python-based tool that utilizes the Natural Language Toolkit (NLTK) library to perform Part-of-Speech (POS) tagging on sentences. POS tagging is an important step in natural language processing, as it assigns grammatical labels to words, enabling deeper linguistic analysis and understanding. The project leverages a decision tree classifier trained on a tagged corpus to accurately predict the POS tags for words in a given sentence. With its high accuracy, the POS Tagger can be used for various applications, including text analysis and information retrieval.

Features

  • Assigns POS tags to words in a sentence
  • Accurate tagging using machine learning models

Usage

Import the pos_tagger module and use the tag_sentence function to tag a sentence:

from pos_tagger import tag_sentence

sentence = "This is a sample sentence."
tags = tag_sentence(sentence)
print(tags)

The output will be a list of (word, tag) tuples representing the POS tags for each word in the sentence.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Parts of Speech Tagger trained on nltk treebank corpus leveraging a Decision Tree Classifier.

Topics

Resources

License

Stars

Watchers

Forks