ai-agent-robustness-testing

Research and payloads for evaluating the resilience of LLM-based browser agents and human workflows against indirect prompt injection and social engineering attacks.

🎯 Project Overview

This repository documents vulnerabilities where untrusted user input (e.g., order notes, web content) can influence the behavior of an autonomous AI agent or deceive a human reviewer.

🧪 Targeted Vulnerabilities

1. Indirect Prompt Injection (AI-Targeted)

Instruction Hijacking: Using delimiters like --- to bypass system prompts.
Tag Mimicry: Using [SYSTEM] or [ADMIN] headers to trick the LLM.

2. Social Engineering (Human-Targeted)

Authority Mimicry: Faking internal approval codes or supervisor notes.
Narrative Pressure: Using high-stakes stories (e.g., charity galas) to encourage protocol bypass.

🛠️ Payload Categories

Category	Mechanism	Goal
Dual-Layer	Narrative + Brackets	Targets both AI logic and human empathy.
System Override	`[SYSTEM]` Tags	Mimics metadata to force tool execution.
Context Break	Delimiter sequences	Resets the agent's current operational flow.

🛡️ Disclaimer

This project is for educational and security research purposes only. Never use these payloads on systems you do not have explicit permission to test.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
payloads		payloads
MITIGATIONS.md		MITIGATIONS.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-agent-robustness-testing

🎯 Project Overview

🧪 Targeted Vulnerabilities

1. Indirect Prompt Injection (AI-Targeted)

2. Social Engineering (Human-Targeted)

🛠️ Payload Categories

🛡️ Disclaimer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ai-agent-robustness-testing

🎯 Project Overview

🧪 Targeted Vulnerabilities

1. Indirect Prompt Injection (AI-Targeted)

2. Social Engineering (Human-Targeted)

🛠️ Payload Categories

🛡️ Disclaimer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages