OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations

OpenAI has unveiled three new voice models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—that offer advanced capabilities in real-time conversations. These models can now reason at a level matching that of GPT-5, translate across 70+ languages, and transcribe live speech.
This development significantly enhances the AI’s ability to understand complex human interactions, making it more versatile for applications ranging from customer service to language learning platforms.
With these new features, OpenAI is pushing the boundaries of what AI can achieve in real-time voice communication, potentially revolutionizing how we interact with technology and each other.

Originally published at the-decoder.com. Curated by AI Maestro.

Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

OpenAI’s new voice model brings GPT-5-level reasoning to real-time conversations