voice agents

OpenAI adds GPT-5-class reasoning to real-time voice with modular models for orchestration

OpenAI’s new GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper aim to cut the heavy engineering overhead behind voice agents. Rather than cramming reasoning, transcription, and translation into one system, OpenAI routes each task to specialized models, letting enterprises orchestrate more cleanly within a 128K context window. The shift could make voice agents cheaper and easier to scale.

Venture Beat

·Published by Beige· on 8 May 2026

Summarised by Beize from a story on Venture Beat on 8 May 2026

OpenAI launches three real time audio models turning voice agents into multitaskers

OpenAI has unveiled three new audio models for developers aimed at making voice agents faster, smarter, and more interactive in real time. GPT-Realtime-2 tackles complex requests even when users interrupt. GPT-Realtime-Translate delivers live multilingual translation, while GPT-Realtime-Whisper provides instant speech to text for captions and notes. Early adopters include companies like Zillow and Priceline.

The Economic Times

·Published by Beige· on 7 May 2026

Summarised by Beize from a story on The Economic Times on 7 May 2026

Your news, in seconds

Get the Beige app — every story in 60 words, updated hourly. Free on iOS & Android.

App Store Play Store

Page 1

voice agents

OpenAI adds GPT-5-class reasoning to real-time voice with modular models for orchestration

OpenAI launches three real time audio models turning voice agents into multitaskers

The full experience is on mobile.