GitHub - RiccardoRobb/Sentiment-analysis-firefox-extension: Tweet sentiment analysis

Problem

Analyzing and categorizing the sentiment of tweets.

Main goal: compare different approaches using different data sets to see which combination succeeds in associating the right sentiment with each tweet.

Secondary goal: create a firefox extension that associate at each tweet a sentiment value converted in an emoji:

Value	Emoji
0	☹️
1	😀

Firefox extension

Datasets

Sentiment140 | Kaggle

[238.8MB] ~ 1,600,000 tweets

Every tweet has been annotated (0 = negative, 2 = neutral, 4 = positive), this dataset contains 6 fields/column:
- target: polarity of the tweet (0 = negative, 2 = neutral, 4 = positive)
- ID: tweet ID
- date: date of the tweet
- flag: the query
- user: the user that tweeted
- text: the text of the tweet

Methods

Logistic regression

Allows to predict a binary outcome using the sigmoid function which outputs a probability between 0 (Negative) and 1 (Positive)

Support Vector Machines

Allows to plot labelled data as points in a multi-dimensional space, there will be present a decision boundary that divides the data points in Positive and Negative

Decision Tree

Use the dataset features to create Positive / Negative questions and continually split the dataset until you isolate all data points belonging to each class

Random Forest

A meta estimator that fits a number of decision tree classifiers on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting

Evaluation framework

Precision
Recall
Accuracy
F1-Score

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
Evaluation		Evaluation
Extension		Extension
Models		Models
BigData_project.ipynb		BigData_project.ipynb
README.md		README.md
Sentiment140.zip		Sentiment140.zip
Slides.pdf		Slides.pdf
TextEmotion.zip		TextEmotion.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problem

Datasets

Methods

Logistic regression

Support Vector Machines

Decision Tree

Random Forest

Evaluation framework

About

Releases

Packages

Languages

RiccardoRobb/Sentiment-analysis-firefox-extension

Folders and files

Latest commit

History

Repository files navigation

Problem

Datasets

Methods

Logistic regression

Support Vector Machines

Decision Tree

Random Forest

Evaluation framework

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages