Realtime API - how to create multiple successive responses without running into errors?

mcpower2 · October 18, 2024, 4:11am

i would like to be able to do something like this:

        await openai_ws.send(json.dumps({
            "event_id": "event_234",
            "type": "response.create",
            "response": {
                "modalities": ['audio', 'text'],
                "instructions": f"THIS IS RESPONSE ONE",
                "voice": "alloy",
                "output_audio_format": "g711_ulaw",
                "temperature": 0.7,
                "max_output_tokens": 150
            }
        }))    

        # ** SOME OTHER COMPUTATIONS HERE **
        await openai_ws.send(json.dumps({
            "event_id": "event_234",
            "type": "response.create",
            "response": {
                "modalities": ['audio', 'text'],
                "instructions": f"THIS IS RESPONSE TWO",
                "voice": "alloy",
                "output_audio_format": "g711_ulaw",
                "temperature": 0.7,
                "max_output_tokens": 150
            }
        }))

but this results in an error:

{'type': 'error', 'event_id': 'event_AJY4PPmCD57vhGHLtGLXE', 'error': {'type': 'invalid_request_error', 'code': None, 'message': 'Conversation already has an active response', 'param': None, 'event_id': None}}

i can stick a

await asyncio.sleep(10)

in between the two create response blocks and it clears the error, but doesn’t give the desired behaviour (i.e. it sleeps 10 seconds BEFORE even the first response is streamed) and even if it slept X seconds between the two responses, it wouldn’t be very robust (first response can take longer or shorter than X)

My guess is that I need to monitor the server events and only submit the second message after a certain response.* event has come through, but it’s unclear to me how exactly to approach it.

Any help is appreciated.

andreas.spaeth · October 18, 2024, 1:47pm

You guessed right. You open your websocket and then you need to listen and handle incoming events. Using the javascript websocket api in the browser it would look like this:

// Handle incoming messages
  ws.onmessage = (event) => {
    try {
      const data = JSON.parse(event.data)
      console.log('[Realtime] Received event:', data)

      // Handle the incoming event
      handleWebsocketEvent(data)
    } catch (error) {
      console.error('[Realtime] Error parsing message:', error)
    }
  }

Handle it like this:

const handleWebsocketEvent = (event) => {
  switch (event.type) {
    case 'response.done':
      // use some logic ( processing a request queue etc.) to check if the correct response is done and to trigger the next request ...
      break
      // ... insert more handlers for other events
    default:
      console.warn('[Realtime] Unhandled event type:', event.type)
  }
}

You can probably also drop the “async” for the websocket calls since you get your response as websocket events.

Topic		Replies	Views
[Realtime API] Server response error message: "Conversation already has an active response" API realtime	22	2102	February 7, 2025
Frequently getting 'Conversation already has an active response' from realtime API API api-realtime	0	85	February 8, 2025
Is it possible to queue responses with realtime speech? API realtime , api-realtime-speech	0	44	March 26, 2025
Handling Concurrent Streaming Responses with OpenAI Assistant API and FastAPI API assistants-api	0	215	October 22, 2024
Realtime API - The server had an error while processing your request Bugs	1	1143	October 15, 2024

Realtime API - how to create multiple successive responses without running into errors?

Related topics