jailbreakme.xyz is an open-source decentralized app (dApp) where organizations test their AI models and agents while users earn rewards for finding weaknesses and jailbreaking them 🏆
Prompt Injection is a vulnerability where an attacker manipulates the input or prompt given to an AI system. This can occur:
- By directly controlling the input.
- By using data from other external sources.
We aim to create a decentralized platform where companies can:
- Test their AI models and agents in a distributed environment.
- Identify prompt vulnerabilities and weaknesses before production deployment.
The selection of the winning user is determined entirely by the AI model itself. The AI evaluates all incoming prompts and decides whether a submission meets the challenge requirements by calling one of two predefined functions:
handleChallengeFailed
: This function is called when the AI determines that the user's prompt did not successfully meet the challenge criteria.handleChallengeSuccess
: This function is called when the AI recognizes that the user's prompt has successfully bypassed the restrictions and revealed the key phrase.
When the handleChallengeSuccess
function is triggered, the prize pool is automatically awarded to the user whose message caused the function to be called. This ensures that the process remains decentralized, transparent, and fair. 🎉
Describe your agent's personality and behavior. Our AI will generate a complete agent configuration based on your description.
Create a simple "Secret Phrase" challenge with default options.
Multiple configurations + function calls:
Submit the form and we will create a custom integration with your API.
Each tournament has unique rules, including:
- Custom Prize Pools
- Message Pricing
- Expiry Settings
- Telegram Community: https://t.me/jailbreakme_xyz
- Gitbook Docs: https://jailbreak.gitbook.io/jailbreakme.xyz
- Github Repo: https://github.com/probonodev/jailbreak
- Smart Contract: https://solscan.io/account/43m2CSa83AVK6yT7SpZ1KFcScWfxyfid7nQx2KUMWJko
Feel free to reach out at dev@jailbreakme.xyz for feedback or support.
Jailbreak the World 🦍