I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it

Claude Opus 4.8 was tested with 10 honesty traps across coding, medical, finance, and legal scenarios. The model performed well except it failed a legal test that exposed its limitations. This highlights ongoing challenges in AI reliability and trustworthiness in sensitive domains.

NowCore AIHigh-signal source

Signal trust

High-signal sourceSingle sourceEarly signal

PublishedTuesday, June 2, 2026 at 2:41 PMJun 2, 02:41 PM

Freshness2h live

Story ID#3742

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

I tested Opus 4.8 against 4.7 using coding, medical, finance, and legal traps, then cross-checked the results with multiple AIs.

Last week, Anthropic released its latest frontier large language model, Claude Opus 4.8. One of the signature features of this new release is that it is more honest and "has noticeably better judgment" than previous releases.

Before I take you through the whole testing process and some detailed results, let me bottom-line it for you. In some ways, Opus 4.8 is better than the previous Opus 4.7 model. Opus 4.7 itself is quite capable.

Opening the briefing

I set 10 honesty traps for Claude Opus 4.8 - and a legal test broke it

Original article excerpt