Realtime Transcription Mismatch and gpt 4o transcribe latest

mcfinley · September 20, 2025, 10:42am

below see example of session.update. With this setup:

response.output_audio.delta and response.output_audio.done get you audio stream from the AI
response.output_audio_transcript.done gets you the text transcript of what the AI said
conversation.item.input_audio_transcription.completed gets you the text transcript of what the user said

session_update:

        session_update_message = {
            "type": "session.update",
            "session": {
                "type": "realtime",
                "model": "gpt-realtime",
                "audio": {
                    "input": {
                        "format": {          
                            "type": "audio/pcm",
                            "rate": 24000
                        },
                        "noise_reduction": {"type":"far_field"},
                        "transcription": {
                            "model": ""gpt-4o-mini-transcribe"
                        },  
                        "turn_detection": {
                            "create_response": True,
                            "interrupt_response": False,
                            "prefix_padding_ms": 300,
                            "silence_duration_ms": 750,
                            "threshold": 0.5,
                            "type": "server_vad"
                            }
                        },
                    "output": {
                        "format": {
                            "type": "audio/pcm",
                            "rate": 24000
                        },
                        "speed":1,
                        "voice": "coral",
                    }
                },
                "instructions": "YOUR PROMPT HERE",
                "max_output_tokens": 1024,
                "output_modalities": ["audio"],
                "tool_choice": "auto",
                "tools":[TOOLS HERE IF ANY],
                "tracing": None,
                "truncation":"auto"
            }
        }

Topic		Replies	Views
Implementing gpt-realtime and gpt4-4o-transcribe for a streaming transcription API streaming , transcribe , gpt-realtime	9	1603	September 15, 2025
GPT-4o-transcribe and audio model ready to use via API? API transcribe	10	4271	March 17, 2026
Realtime streaming transcription API api-realtime	4	604	February 23, 2026
GPT-4o-transcribe realtime, the .delta updates not received during the transcription API transcribe	7	599	February 10, 2026
RealTime API Transcription errors Bugs realtime	7	2530	January 9, 2025

Realtime Transcription Mismatch and gpt 4o transcribe latest

Related topics