Best way to connect to open-ai realtime http then websocket or websocket first then update

lahiru · June 8, 2026, 8:27pm

Up until recently we were using openai direclty with only websocket,

Create the websocket connection
Update the connection with the prompt and voice etc

Lately I’ve faced a case where we get the following error, when this happen my whole prompt is lost and aI starts talking like, Hey whats in your mind. This is a disaster in our case.

invalid_request_error code=cannot_update_voice message=Cannot update a conversation’s voice if assistant audio is present.

This is because of a timing issue where openai successfully start sending audio before the update request goes in, isn’t it (this is what claude code is telling me)

Anyone experience this ?

VeitB · June 8, 2026, 8:54pm

At what point in the process are you calling response.create?

The goal is to separate the immutable setup from any later updates. If this happens in the wrong order, assistant audio may start before the configuration has been updated.

Right after creating the session, send the initial session.update with voice, instructions, audio formats, tools, turn detection, and any other required configuration.

Once you receive session.updated, call response.create. This is where your failure case may originate from.

Essentially, you want to prevent the conversation from starting until the session has received its initial update.

lahiru · June 8, 2026, 9:19pm

Thanks VeitB.

I do exactly that, but for some reason twilio send some audio before i update, this happend 3 times during my last 50 calls i used to test. And this is giong to happen during the demo for sure

Also while creating the session, I’m doing other stuff to pull the right prompt based on the caller data.

I’m thinking of using the http endpoint (that will add 100ms).

lahiru · June 8, 2026, 9:27pm

I tried with the http endpoint, it adds a few hundred miliseconds at the beginning, but I can avoid it, lets see what other issues it will bring. us.

VeitB · June 8, 2026, 10:01pm

Have you been logging the chain of events to make absolutely sure that session.updated with the voice and prompt is always received before sending response.create? Especially in the cases where the model output is a plain assistant reply.

If yes this could be a bug.

Topic		Replies	Views
OpenAI_RealTime_Questions API realtime , api-realtime , api-realtime-speech	1	320	February 20, 2025
Constantly disconnecting after session update with Realtime API Bugs realtime , api-realtime	7	2119	March 6, 2025
Realtime api phone use case - speaking text Feedback assistants-api , realtime	16	2247	November 5, 2024
SIP Trunking + Realtime API Call Flow — Initial Greeting Delay & Language Mismatch API realtime , api-realtime , gpt-realtime	4	300	November 18, 2025
Realtime Websocket Confusion API gpt-4	5	1424	September 10, 2025

Best way to connect to open-ai realtime http then websocket or websocket first then update

Related topics