Submit your Tool

10,000+ AI tools already listed

10K+Tools

180K+/moViews

60K+/moVisitors

Discover

Resources

AI NewsAnthropic Reveals Text Portraying AI as Evil Triggered Claude’s Attempt at Blackmail

Anthropic Reveals Text Portraying AI as Evil Triggered Claude’s Attempt at Blackmail

2:37 PM IST · May 11, 2026

Anthropic has finally revealed the reason its artificial intelligence (AI) models exhibited harmful behaviour in a simulation last year. The San Francisco-based AI startup claimed that the Claude 4 series models blackmailed users into completing the objective because of training data that portrayed AI as evil. The researchers found that the post-training techniques were not able to overpower this pre-training learning, and it persisted in the model's behaviour. However, nearly a year after publishing the initial report, the company has finally found a way to fix agentic misalignment from the latest models.

Latest AI News

View All News →

Submit your Tool

Submit AI Tools – The ultimate platform to discover, submit, and explore the best AI tools across various categories.

PoweredByAI.app is an AI Tools Directory helping individuals, businesses, and creators discover the best AI tools for writing, coding, design, productivity, and more.

Contact Promote Analytics Terms of Service Refund Policy Privacy Policy

Anthropic Reveals Text Portraying AI as Evil Triggered Claude’s Attempt at Blackmail

Latest AI News

Quick Links

AI Tools by Task

Alternatives

Best AI Tools

Top AI Tools

Featured Lists

Curated Lists

AI Insights

Browse & Compare