Advancing voice intelligence with new models in the API

OpenAI has introduced new real-time voice models in their API that can reason, translate, and transcribe speech. These models enable more natural and intelligent voice interactions. This advancement improves voice intelligence capabilities for developers and users.

ArchiveLaunch

Signal trust

Single sourceEarly signal

PublishedThursday, May 7, 2026 at 12:00 PMMay 7, 12:00 PM

FreshnessArchive

Story ID#1309

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.

A new generation of realtime voice models that can reason, translate, and transcribe as people speak.

We’re introducing three audio models in the API that unlock a new class of voice apps for developers. With these models, developers can build voice experiences that feel more natural, respond more intelligently, and take action in real time:

Opening the briefing

Advancing voice intelligence with new models in the API

Original article excerpt