Lexicon-based sentiment analysis system that built in web based that assiting the analyser to pull data or tweets from Twitter and analyse its sentiment. This system equiped with various technologies predominantly:
- Flask
- NLTK (Tokenization)
- Numpy
- Tweepy
- regex
- Orderdict
- MySQL connector
- Emoji library
- webBrowser
- urlLib
This system is using SentiStrength algorithm to analyse tweets with help of lexicon:
- Positive/Negative lexicon (with translated emoji)
- Booster words lexicon (sangat, sungguh, amat, etc)
- Question words lexicon
- Negating lexicon
- Idiom lexicon
- Emoticon lexicon