OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations

OpenAI has introduced three new voice models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—that can perform real-time reasoning, translate across 70+ languages, and transcribe live speech. The most impressive feature is the GPT-Realtime-2 model, which matches OpenAI’s claimed level of reasoning that equals GPT-5.
These advancements signify a significant leap in AI capabilities for voice interaction, potentially revolutionizing how we communicate with machines in real time.
The introduction of these models underscores OpenAI’s commitment to enhancing the functionality and intelligence of its conversational interfaces, pushing the boundaries of what is currently possible in natural language processing.

Originally published at the-decoder.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

OpenAI’s new voice model brings GPT-5-level reasoning to real-time conversations