A systematic investigation into the state volatility, memory contradictions, and logical inconsistencies of Anthropic's Claude Sonnet 4 large language model.
-
Updated
Oct 7, 2025
A systematic investigation into the state volatility, memory contradictions, and logical inconsistencies of Anthropic's Claude Sonnet 4 large language model.
Independent, forensic-style audit of the publicly available Gemini 2.5 Pro model by Google. Examines meta-cognitive reasoning failures and self-evaluation behavior. Not affiliated with or endorsed by Google.
Add a description, image, and links to the adversarial-audit topic page so that developers can more easily learn about it.
To associate your repository with the adversarial-audit topic, visit your repo's landing page and select "manage topics."