Input_audio_transcription not working in Real-Time — related to g711_ulaw?

parisa.eslambolchila · December 26, 2024, 11:13am

I had the same problem as everyone here i.e. enabling whisper-1 model and using g711_ulaw with Twilio did not produce any input_audio_transcript. In the end, I logged everything coming back from OpenAI inside send_to_twilio function

async for openai_message in openai_ws:
        response = json.loads(openai_message)
        print(f"Full event received: {json.dumps(response, indent=2)}")

After looking at the logs, I noticed the user audio transcription is generated after the transcription is completed and NOT after being committed to the input_audio_buffer .

If anyone wants to try, here are the lines code

if response.get('type') == 'conversation.item.input_audio_transcription.completed':
       transcription = response.get('transcript')
       if transcription:
           print(f"User said: {transcription}")

Topic		Replies	Views
[Realtime API] Input audio transcription is not showing Bugs realtime	12	4264	July 3, 2025
Getting no response event for input_audio_transcription in realtime ws API realtime , api-realtime	14	3046	July 17, 2025
Unable to Access User Audio Transcript in Realtime API API api-realtime	5	1945	February 10, 2025
Realtime API: session update doesn't change input audio format Bugs realtime	25	3058	November 19, 2024
Missing input audio transcription API api-realtime	6	352	May 12, 2025

Input_audio_transcription not working in Real-Time — related to g711_ulaw?

Related topics