Original article excerpt
Server-side extracted preview paragraphs from the original source.
Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice experiences.
A new generation of realtime voice models that can reason, translate, and transcribe as people speak.
We’re introducing three audio models in the API that unlock a new class of voice apps for developers. With these models, developers can build voice experiences that feel more natural, respond more intelligently, and take action in real time: