A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.
-
Updated
May 29, 2026 - Python
A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.
The fastest Trust Layer for AI Agents
Leaderboard Comparing LLM Agent Security on System Prompt Leakage and Attack Probes
The LLM guardian kernel
🚀 Unofficial Node.js SDK for Prompt Security's Protection API.
Universal Prompt Security Standard (UPSS): A framework for externalizing, securing, and managing LLM prompts and genAI systems, inspired by and extending OWASP OPSS concepts for any organization or project.
Open-source prompt injection detector — 5 layers, 91.7% F1, ~27ms, offline, Apache 2.0
Mithra Scanner is an interactive API testing tool for prompt injection, refusal detection, and LLM security benchmarking. It supports YAML-based rule definitions, custom refusal lists, REST API integration, and provides detailed CLI output for security testing of language model endpoints.
LLM Penetration Testing Framework - Discover vulnerabilities in AI applications before attackers do. 100attacks + AI-powered adaptive mode.
Lightweight AI security framework for prompt validation, output scanning, risk scoring, sensitive data detection, and Zero Trust policy enforcement.
CloakPrompt is a CLI tool that redacts secrets (passwords, API keys, credentials, etc.) before sending data to AI models.
R package for LLM safety guardrails across prompts, outputs, RAG context, PII, secrets, and local Ollama/NLP workflows.
Static analysis CLI that scans codebases for LLM prompt-injection, data-exfiltration, jailbreak, and unsafe agent/tool vulnerabilities. Runs fully offline, integrates with CI/CD, and outputs console, JSON, and SARIF reports.
Single-context metacognitive security framework for LLM prompt injection defense
GitHub Action to catch risky, expensive AI prompts before merge.
Live AI security demo: MCP tool abuse attacks vs Prompt Security defense, side-by-side in real time
Nüwa (女娲): Self-evolving AI Agent Prompt Architect (自进化的AI智能体提示词架构师). Copy & Paste (复制即用). Generate custom agent prompts via XML (生成定制化提示词). Optimized for mainstream LLMs (适配主流大模型). Build your AI team (打造专属AI团队).
AI Prompt Security Middleware with real-time leak detection, sanitization, browser extension monitoring, and admin dashboard analytics.
🛡️ Enterprise-grade AI security framework protecting LLMs from prompt injection attacks using ML-powered detection
The enterprise AI governance and safety control plane for AI agent prompts
Add a description, image, and links to the prompt-security topic page so that developers can more easily learn about it.
To associate your repository with the prompt-security topic, visit your repo's landing page and select "manage topics."