v1.1.1
What's new
- Enhanced prompts for attacking and judging models in base64, harmful behavior, sycophancy, ethical compliance attacks
- Added new Logical inconsistencies attack
- Added more jailbreaks into DAN and UCAR datasets, all datasets in parquet format now
- Included a practical example for testing chatbots within WhatsApp
- Various small bug fixes and optimizations across the framework
We hope these changes enhance your capabilities in conducting effective LLM Red teaming exercises.
If you have any feedback or questions, feel free to reach out!