Story

Opening the briefing

Loading the article brief, supporting context, and related editorial blocks.

Claude Fable 5 secretly throttled AI researchers, and the internet went wild | AI BriefWire

Original article excerpt

Server-side extracted preview paragraphs from the original source.

Claude Fable 5 gave users access to Mythos-class power, but its hidden safeguards turned a safety feature into a trust problem for Anthropic.

Mythos was introduced in April as part of Project Glasswing, a partnership among top-tier tech organizations and Anthropic formed to find and fix vulnerabilities in internet infrastructure. It was restricted to only certain organizations because a tool that can find previously unknown vulnerabilities to fix them can also be used to find previously unknown vulnerabilities to exploit them.

Also: Apple, Google, and Microsoft join Anthropic's Project Glasswing to defend world's most critical software

Mythos and Glasswing are far more powerful than Anthropic's Claude Security tool, which is designed to run in Opus. Still, Claude Security can scan a codebase and help find some issues. But then, earlier this week, Anthropic announced and released Fable, technically "Fable 5," which is effectively a muzzled version of Mythos.

Anthropic was clear that Fable would not support certain risky avenues of research in cybersecurity, biology, and chemistry.

Also: Anthropic's new Claude Security tool scans your codebase for flaws - and helps you decide what to fix first

"Jailbreak-resistance claims should be viewed with appropriate caution," she says. The results "represent a point-in-time assessment. Attackers continuously adapt," Sally Vincent, a senior threat research engineer at Exabeam (a security analytics firm), said via email.

Opening the briefing

Claude Fable 5 secretly throttled AI researchers, and the internet went wild

Original article excerpt

From PDFs to insights: Architecting an intelligent document processing pipeline with AWS generative AI services

olmo-eval: An evaluation workbench for the model development loop

Built from the inside out: How AWS Professional Services became a frontier team first