OpenAI’s new GPT Realtime model is now live in Vapi’s dashboard and API.
We’ve been testing it ahead of launch and it’s a noticeable step forward for real-time, production-grade use cases. Conversations feel more natural, with sharper turn-taking and clearer audio quality.
What’s New in GPT Realtime
Compared to earlier real-time models, GPT Realtime delivers:
- Lower latency: The response time feels natural. It’s fast enough for real-time, production-grade use cases where back-and-forth conversation is critical.
- Sharper instruction following: Handles 3–5 turn transactional flows with reliability.
- Multilingual flexibility: Switches languages mid-sentence with high accuracy.
- Improved tool‑calling design: Prompt-based creation is more intuitive and powerful.
- Voice upgrades: Cedar delivers a warm, conversational tone and strong accent emulation; Marin adds clarity for structured communication.
- Structured data handling: Names, phone numbers, and emails are repeated back with natural pacing, without sounding robotic.
- Audio quality boost:: Clearer, crisper sound with fewer distortions compared to earlier versions.
Why This Matters
These improvements open up applications where timing, tone, and nuance matter.
Here are a few use cases already being explored with early adopters:
- Healthcare triage: Collecting and repeating structured data like names and insurance IDs, with tone and pacing that patients trust.
- Coaching and training: Real-time back-and-forth with natural pauses and encouragement instead of robotic interruptions.
- Customer support: Transactional flows such as rescheduling appointments or troubleshooting accounts, handled in 3 to 5 turns reliably.
- Global services: A user can start in English, switch to Spanish, and continue the conversation naturally.
Availability
The GPT Realtime model is available to all Vapi users now.
You can try it directly from the Vapi dashboard or integrate it in your agents via API.
We’re excited to see what you build with it.