Turn-by-turn detection of manipulation accumulation in multi-turn AI conversations. Built for APART Research AI Manipulation Hackathon 2026.
ai-safety ai-research ai-alignment behavioral-analysis anthropic llm-evaluation manipulation-detection bloom-framework
-
Updated
Jan 11, 2026 - Python