[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms
natural-language-processing artificial-intelligence safety steering-behaviors sta large-language-models controlled-generation model-editing knowledge-editing acl2025 easyedit2
-
Updated
Jun 4, 2025 - Python