- OpenAI has unveiled three new voice models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—that offer advanced capabilities in real-time conversations. These models can now reason at a level matching that of GPT-5, translate across 70+ languages, and transcribe live speech.
- This development significantly enhances the AI’s ability to understand complex human interactions, making it more versatile for applications ranging from customer service to language learning platforms.
- With these new features, OpenAI is pushing the boundaries of what AI can achieve in real-time voice communication, potentially revolutionizing how we interact with technology and each other.
Originally published at the-decoder.com. Curated by AI Maestro.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.

