GitHub - Eslam-Ayman/NLP: Sentiment Analysis ( Natural Language Processing )

Abstract

This Sentiment Analysis project is about knowing if the review of the movie is positive or negative review , we used more than one model during the process to know which model is the best.

Data set

The Dataset we used is labeled dataset consist of 2000 rows of labeled POS and NEG data http://boston.lti.cs.cmu.edu/classes/95-865-K/HW/HW3/

Pre-processing

The classification algorithm will need some sort of feature vector in order to perform the classification task. The simplest way to convert a corpus to a vector format is the bag-of-words approach, where each unique word in a text will be represented by one number. Firstly we use function to removes punctuation, stopwords, and returns a list of the remaining words, or tokens. Then we make vectorization to convert each review into a vector.

Methodology

Experiment 1 The first model we used is LinearSVR from SVM from the library sklearn,this is a linear classifier we tried it first with the default loss which is squared_hinge and then the other loss which is hinge and changed the iteration number.

Experiment 2 The second model we used is MultinomialNB from the library sklearn, this is the Naive bayes first we tried it with the all the default parameters then we did cross validation on the Dataset and changed the alpha parameter.

Experiment 3 The last model we used is the LogisticRegression from linear_model from the library sklearn, this model differ from the Naive Bayes that the features weight takes features dependence into account first we tried with the default parameters. then we changed the iteration number and the multiclass parameters

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
README.md		README.md
SVM.py		SVM.py
finalized_SVM_model.sav		finalized_SVM_model.sav
finalized_model.sav		finalized_model.sav
finalized_naive_bayes_model.sav		finalized_naive_bayes_model.sav
logesticRegression.py		logesticRegression.py
movie-pang02.csv		movie-pang02.csv
naiveBayes.py		naiveBayes.py
naiveBayes_model.py		naiveBayes_model.py
project		project
regression_model.py		regression_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

Data set

Pre-processing

Methodology

Results

About

Releases

Packages

Languages

Eslam-Ayman/NLP

Folders and files

Latest commit

History

Repository files navigation

Abstract

Data set

Pre-processing

Methodology

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages