A fast accurate API for detecting NSFW images.
-
Updated
May 31, 2024 - Python
A fast accurate API for detecting NSFW images.
The Open Source Firewall for LLMs. A self-hosted gateway to secure and control AI applications with powerful guardrails.
Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021
Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).
OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
NudeDetect is a Python-based tool for detecting nudity and adult content in images. This project combines the capabilities of the NudeNet library, EasyOCR for text detection, and the Better Profanity library for identifying offensive language in text.
LyricMind is an AI-powered cognitive firewall that analyzes song lyrics to protect against unconscious absorption of harmful content. Using customizable frameworks and LLMs, it detects risky patterns and integrates with music players to trigger protective actions when thresholds are exceeded.
An ML driven NSFW moderator for Slack 🚨👮♂️🤖
Automatically convert audio and video files and live audio streams to text with AssemblyAI's Speech-to-Text APIs. Do more with Audio Intelligence - summarization, content moderation, topic detection, and more. Powered by cutting-edge AI models.
Content moderation papers @ ICWSM 2019 & AAAI 2020.
One package to moderate them all
The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.
NLP/ViT-driven bot for detection & moredation of inappropriate content in Telegram groups
Python SDK
一款针对 LLM 输入侧审查的精确逆向分析工具。自动定位 NewAPI、OneAPI 及任何实施基于字典规则进行 Prompt 过滤的 API 网关中的敏感关键词 | A precision reverse-engineering tool for LLM input censorship. Automatically pinpoints blocked keywords in NewAPI, OneAPI, and any API gateways enforcing dictionary-based prompt filtering.
Trying to make sense of the EU's DSA Transparency DB
🛡️ AI-powered Chinese cyberbullying detection with explainable AI, LINE Bot, GPU acceleration. 中文網路霸凌防治系統
A Python-based tool for detecting adult content in images using machine learning APIs and computer vision techniques. This tool is designed for content moderation, parental controls, and maintaining safe digital environments.
AI agent for automated content moderation of movies and books, employing Retrieval-Augmented Generation (RAG) and natural language processing Large Language Models to identify, discover, and summarize potentially concerning content for informed decision-making.
End-to-end moderation system for text & images. FastAPI backend with provider toggles (OpenAI & Google Perspective/Vision), PyTorch TorchScript prefilter microservice, policy.yaml rules, token-bucket rate limiting, in-memory caching, React/Vite frontend, training scripts, and Kubernetes manifests.
Add a description, image, and links to the content-moderation topic page so that developers can more easily learn about it.
To associate your repository with the content-moderation topic, visit your repo's landing page and select "manage topics."