This project is started due to interactions between large communities among different channels in Rocket Chat, there was a need for support of an optional moderation service for offensive content (automated content moderation) for channels which can be deployed by an administrator on their sole discretion.
The Model basically classifies input image in 2 broad categories:
- NSFW(Not Safe for Work)
- SFW(Safe for Work)
The data can be found below.
-
https://www.kaggle.com/omeret/nsfw-nsafe & https://www.kaggle.com/omeret/nsfw-safe.
-
- Nude
- Sexy
- Safe
-
- Female genitalia
- Male genitalia
- Breasts
-
Alex's Dataset . For classes animated and porn we can scrap the data from here.
-
Instagram Scrapper For class Semi Nude I used his tool to scrape few Instagram pages that regularly post arousing images of men and women.
-
Dataset 3
I also encourage everyone to read the piece this data was scraped for! link -
The 2015 paper entitled "Applying deep learning to classify pornographic images and videos" applied various types of convnets for detecting pornography. The proposed architecture achieved 94.1% accuracy on the NPDI dataset, which contains 800 videos (400 porn, 200 non-porn "easy" and 200 non-porn "difficult"). More traditional computer vision methods achieved 90.9% accuracy. The proposed architecture also performs very well regarding the ROC curve.
-Source: StackExchange