8x8 Integrates GPT Realtime 2 into AI Studio for Advanced Voice AI
- Tim Banting
- 3 days ago
- 2 min read
8x8 has announced the addition of OpenAI’s GPT Realtime 2 to its AI Studio platform, introducing "GPT-5-class" reasoning to production-ready voice agents.

The integration aims to improve the reliability of automated customer interactions by enhancing tool calling and transcription accuracy. The update is currently available in early access for 8x8 customers, allowing businesses to deploy more sophisticated voice agents capable of handling complex, multi-step workflows.
What: How 8x8 GPT Realtime 2 Voice AI Changes CX Automation
The customer experience (CX) market is increasingly shifting from simple chatbots to sophisticated voice-based AI capable of "reasoning" through live conversations. Traditionally, voice AI has struggled with "hallucinations" or failing to execute specific tasks like booking appointments or retrieving account data during a call.
By adopting GPT Realtime 2, 8x8 is moving into a new tier of capability comparable to GPT-5-level logic. This mirrors a broader industry trend where communications platform providers (CPaaS) are racing to integrate the latest LLM (Large Language Model) updates to reduce latency and improve the "human" feel of automated service.
The use of OpenAI’s Realtime-Whisper model specifically addresses the historical pain point of poor transcription, which often leads to downstream errors in AI task execution.
Capabilities & Limitations
Capabilities
Advanced Reasoning: Utilises GPT-5-class logic and a 128K context window to maintain coherence during long, complex customer service calls.
Granular Effort Control: Provides a "reasoning effort control" toggle, allowing administrators to choose between high-speed responses for simple queries or deeper "thinking" time for complex troubleshooting.
Reliable Tool Calling: Improved ability to execute external tasks such as account verification, inventory checks, and call routing with fewer technical failures.
Limitations
Trade-off in Latency: Selecting higher reasoning efforts for complex tasks can result in increased response times, potentially affecting the natural flow of conversation.
Opt-in Requirement: The upgrade is not automatic for existing production agents; users must manually update configurations via the agent editor to access the new features.
Early Availability: As the feature is currently in early availability, it may not yet be accessible to all global customers or fully optimised for all secondary languages.
Signals to Watch
Operational Latency: Will the added "reasoning effort" create a noticeable delay that frustrates customers used to instant human or legacy bot responses?
Legacy Model Retirement: When will 8x8 move GPT Realtime 2 from "early availability" to the default standard, and how will they manage the migration of legacy agents?
Data Privacy Compliance: As voice data is processed through OpenAI’s latest models, how will 8x8 maintain its "privacy-by-design" principles in highly regulated sectors like finance or healthcare?



Comments