A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
-
Updated
Feb 17, 2023
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
A comprehensive suite of high-level NLP tasks for Persian language
chakki's Aspect-Based Sentiment Analysis dataset
A collection of scripts to collect crypto market sentiment data
MBTI dataset,Sentiment Dataset,Micro Emotion,微博情感数据集,multi-label Chinese affective computing dataset. personality traits with six emotions and micro-emotions, each annotated with intensity levels.
An automatically annotated sentiment analysis dataset of product reviews in Russian.
Encyclopedic Hub for Sentiment Dictionaries
Repo for Turkish movie reviews dataset.
A perceptron based text classification based on word bag feature extraction and applied on sentiment analysis dataset
Sentiment analysis of bangla language.
WRIME for huggingface datasets
A Sentiment Analysis Dataset of Comments in Serbian
Scikit-Learn & Keras LSTM | Projeto voluntário de análise de sentimentos dos tweets da quarentena | Python
Repo for Turkish sentiment analysis dataset, "Vitamins and Supplements Customer Reviews"
Fine-tuned DistilBERT pipeline for employee sentiment analysis. 30-day flight-risk model (R2=0.81, MSE=1.34). 10,000+ records. Dockerized. Springer Capital internship.
CI-Guided Data Curation: Using prediction instability to detect label noise. Validated on SST-2 with 10% noise injection. Recovered 40% of accuracy gap to perfect labels. Part of the Collapse Index framework.
Sentiment analysis on 50,000 IMDB reviews using TF-IDF, Word2Vec, and BERT
An end-to-end NLP pipeline for binary sentiment classification using the Rotten Tomatoes Sentence Polarity dataset.
Add a description, image, and links to the sentiment-analysis-dataset topic page so that developers can more easily learn about it.
To associate your repository with the sentiment-analysis-dataset topic, visit your repo's landing page and select "manage topics."