Hi everyone, I am implementing the OpenAI Realtime API and have configured the session to include audio transcription using the following configuration:
input_audio_transcription: {
model: “whisper-1”
}
However, the audio input provided by the user does not generate a transcript. Instead, the transcript
field always returns null
. Below is the response received from the API:
{
"type": "conversation.item.created",
"event_id": "event_AkR2BLE7l9oMUumIva3Ku",
"previous_item_id": null,
"item": {
"id": "item_AkR29UqpepukIR4ioIUYO",
"object": "realtime.item",
"type": "message",
"status": "completed",
"role": "user",
"content": [
{
"type": "input_audio",
"transcript": null
}
]
}
}
so how can I get the user transcript from the Realtime API?
Can someone please help?