===========================================================
Documentation for NaijaSenti v.1.0
===========================================================
Contents:
- Introduction
- Data
- Contact Information
- References
- Introduction
NaijaSenti is an open-source sentiment and emotion corpora for four major Nigerian languages. This project was supported by lacuna-fund initiatives.
For more information, visit the project Github page here: https://github.com/hausanlp/NaijaSenti
- Data
NaijaSenti Datasets consists of the following data
-
Manually Annotated Twitter Sentiment Dataset
-
Manually Annotated Sentiment Lexicon
-
Semi-automatically Translated emotion lexicon
-
Semi-automatically Translated sentiment lexicon
-
Large Scale Unlabled Twitter Sentiment Corpus
-
Stop-words for Hausa, Igbo, Pidgin and Yoruba
- Contact Information
In case you have any questions regarding NaijaSenti, please contact Shamsuddeen Muhammad for further information. email: shamsuddeen2004@gmail.com
- References
If you plan to use the dataset for research or academic purposes, please cite the following publication.
@misc{muhammad2022naijasenti, title={NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis}, author={Shamsuddeen Hassan Muhammad and David Ifeoluwa Adelani and Sebastian Ruder and Ibrahim Said Ahmad and Idris Abdulmumin and Bello Shehu Bello and Monojit Choudhury and Chris Chinenye Emezue and Saheed Salahudeen Abdullahi and Anuoluwapo Aremu and Alipio Jeorge and Pavel Brazdil}, year={2022}, eprint={2201.08277}, archivePrefix={arXiv}, primaryClass={cs.CL} }
version 1.0 last modified 04/19/2022