Where the goblins came from

OpenAI analyzed the origin and spread of 'goblin' outputs in GPT-5 models. They identified the timeline, root causes, and implemented fixes for these personality-driven quirks. This helps improve the reliability and user experience of GPT-5.

ArchiveMajor

Signal trust

Single sourceEarly signal

PublishedWednesday, April 29, 2026 at 10:00 PMApr 29, 10:00 PM

FreshnessArchive

Story ID#1551

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior.

Starting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors. Unlike model bugs that show up through a tanking eval or a spiking training metric and point back to a specific change, this one crept in subtly. A single “little goblin” in an answer could be harmless, even charming. Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from.

Opening the briefing

Where the goblins came from

Original article excerpt