Original article excerpt
Server-side extracted preview paragraphs from the original source.
For the first time, developers can also instruct the text-to-speech model to speak in a specific way—for example, “talk like a sympathetic customer service agent”—unlocking a new level of customization for voice agents.
A new suite of audio models to power voice agents, now available to developers worldwide.
Update on August 28, 2025: We announced the general availability of the Realtime API. Learn more here.