Original article excerpt
Server-side extracted preview paragraphs from the original source.
Hackers gained access to Anthropic’s Mythos AI model, whose cybersecurity capabilities apparently make it too dangerous for public release.
There’s no good excuse for letting hackers into an AI model too dangerous for public release.
Anthropic’s tightly controlled rollout of Claude Mythos has taken an awkward turn. After spending weeks insisting the AI model is so capable at cybersecurity that it is too dangerous to release publicly, it appears the model fell into the wrong hands anyway.