OpenAI’s new GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper aim to cut the heavy engineering overhead behind voice agents. Rather than cramming reasoning, transcription, and translation into one system, OpenAI routes each task to specialized models, letting enterprises orchestrate more cleanly within a 128K context window. The shift could make voice agents cheaper and easier to scale.
Swipe through stories, personalise your feed, and save articles for later — all on the app.