NLPirate

This project contains two finalized classifiers, designed to determine the authenticity of a review.

initial_classifier.py is a classifier built based on the work of Myle Ott and team on review authenticity. final_classifier.py is our attempt to develop a classifier that more accurately classifies review authenticity.

Dependencies

Requreid for final_classifier.py:

pyenchant

Data

The data for our project is courtesy of Myle Ott, and can be found on his website.

Included is a data_transform.py script to transform the data out of its folder structure and into standardized_data.csv, a tab-delimited file. This does not need to be run, as the standardized_data.csv file is included in the repository.

Execution

To run either of the classifiers, simply invoke them from the command line: python final_classifier.py. The classifiers will use the entirity of the dataset, making use of k-fold validation, and then report on their results at the end of classification, as well as the most useful features for the classification.

Also included are two Python notebook files that make use of the visualization library ELI5. Running these from Jupyter will allow for investigation into the classification of particular reviews, with highlisted features in the text, and feature weighting for the particular review.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
vendor		vendor
vis		vis
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
combined_svm.ipynb		combined_svm.ipynb
data_transform.py		data_transform.py
extract_features.py		extract_features.py
final_classifier.py		final_classifier.py
initial_classifier.py		initial_classifier.py
standardized_data.csv		standardized_data.csv
wordcount_vec_svm.ipynb		wordcount_vec_svm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLPirate

Dependencies

Data

Execution

About

Releases

Packages

Contributors 3

Languages

License

prputnam/NLPirate

Folders and files

Latest commit

History

Repository files navigation

NLPirate

Dependencies

Data

Execution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages