Skip to content
#

prompt-security

Here are 22 public repositories matching this topic...

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

  • Updated May 29, 2026
  • Python

Semantic intent evaluation for agentic systems. Detects instruction-layer risks including authority laundering, consent bypass, concealed execution, and semantic attack patterns in SKILL.md and agent configuration files.

  • Updated May 11, 2026
  • Python

SentinelShield: Advanced AI content moderation combining Llama Prompt Guard 2, rule-based filtering, and real-time analysis. Protect your applications from harmful content, prompt injection attacks, and inappropriate material with sub-second response times.

  • Updated Aug 7, 2025
  • Python

MalPromptSentinel (MPS) is a Claude Code skill that detects malicious prompts in uploaded files before Claude processes them. It provides two-tier scanning to identify prompt injection attacks, role manipulation attempts, privilege escalation, and other adversarial techniques.

  • Updated Nov 27, 2025
  • Python

Improve this page

Add a description, image, and links to the prompt-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-security topic, visit your repo's landing page and select "manage topics."

Learn more