I’m integrating OpenAI’s realtime speech-to-speech API into a FreeSWITCH module. The problem I’m having is that OpenAI immediately closes the socket connection after the agent begins speaking. Specifically the websocket gets closed by the far end (openAI’s side). The agent is in the process of emitting “response.audio_transcript.delta” events, then poof the socket gets killed.
Oddly, this problem DOES NOT happen when I connect to Azure’s implementation of the realtime API.
I’ve looked through the docs but don’t find anything relating to why OpenAI would be closing the socket. Can someone from OpenAI please comment on what conditions will cause the server to close the socket connection?
Hello, I am currently working on AI intelligent outbound calls and plan to integrate Azure’s real-time API into FreeSWITCH. I would like to know how you integrated it—did you modify the underlying FreeSWITCH modules? Thank you for your guidance.