Is the same!!!
I try with this one.
{
“audio”:{
“input”:{
“format”:“audio/pcmu”
},
“output”:{
“format”:“audio/pcmu”,
“voice”:“marin”
}
},
“instructions”:“You are a helpful IVR assistant. You can answer in English.\nGreet the caller warmly and stay on the line awaiting their response.\nDo not hang up or send a BYE unless the caller explicitly ends the conversation or the system instructs you to end it.\nGuide the caller with short, friendly questions and pause to listen after each response.\nKeep the call active and be patient while waiting for audio from the caller.”,
“model”:“gpt-realtime”,
“response”:{
“conversation”:[
{
“content”:[
{
“text”:“Hello, I’m your IVR assistant. How can I help?”,
“type”:“output_text”
},
{
“audio”:{
“format”:“wav”,
“transcript”:“Hello, I’m your IVR assistant. How can I help?”,
“voice”:“marin”
},
“type”:“output_audio”
}
],
“role”:“assistant”
}
],
“modalities”:[
“audio”
]
},
“type”:“realtime”
}
”We are sending the correct accept request (200 OK, with save_session => true and PCM audio), but the response body is empty and the call is immediately terminated with BYE. It appears Realtime sessions are not enabled for this account, so no session_id is returned. Without a session_id we can’t open a WebSocket or stream audio back, so the model says the initial greeting and closes the call right away.
Could you check whether session persistence is enabled for our project? If possible, please allow the session to be saved (returning session_id in the accept response). Once we receive session_id, we can build a WebSocket client to keep the conversation going instead of closing immediately.”
Please help us.
Right now the conversation is starting and immediately after that we get a BYE from OpenAI
Have you considered using Asterisk AudioSocket and integrating Asterisk with OpenAI’s Realtime WebSocket APIs?
That approach would let you stream audio directly from Asterisk to OpenAI over a persistent WebSocket connection, enabling low-latency bidirectional audio (STS) without adding unnecessary intermediaries. It can simplify the architecture and give you tighter control over session handling and media flow.
Curious to know if you’ve evaluated this setup or if there are specific constraints preventing you from using it.