An adversarial testing framework for Google Cloud's Model Armor AI firewall. Deploys Model Armor via Terraform and runs an autonomous ADK-based attack agent across five control categories: prompt injection, jailbreak, hate speech, malicious URL detection, and sensitive data protection.
Built to answer the question: What does a GCP AI content filter actually stop?