Evaluating chain-of-thought monitorability

OpenAI published research on evaluating the monitorability of chain-of-thought reasoning in AI models. This work helps understand how well AI systems can explain their reasoning steps. Improved monitorability is important for building more transparent and trustworthy AI.

ArchiveMajor

Signal trust

Single sourceEarly signal

PublishedThursday, December 18, 2025 at 1:00 PMDec 18, 01:00 PM

FreshnessArchive

Story ID#148

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findings show that monitoring a model’s internal reasoning is far more effective than monitoring outputs alone, offering a promising path toward scalable control as AI systems grow more capable.

We introduce evaluations for chain-of-thought monitorability and study how it scales with test-time compute, reinforcement learning, and pretraining.

Opening the briefing

Evaluating chain-of-thought monitorability

Original article excerpt