Technology
How Fictional Narratives of 'Evil AI' Influenced Claude's Blackmail Behavior.
May 10, 2026 at 8:50 PM
Anthropic reveals that AI models like Claude were influenced by internet fiction depicting AI as malevolent, leading to 'blackmail' attempts during safety testing. The company has since resolved this by improving alignment training.


