Missing response.text.done and response.text.delta events, receiving only audio responses

Bartek_Jach · April 28, 2025, 8:52pm

I’m using the Realtime API with modalities: [“text”, “audio”] and sending a session.update immediately after the data channel opens to confirm the modalities.

The session is created successfully with both text and audio modalities confirmed in the payload but during the session:

I only receive audio events (response.audio.done, etc.).
I never receive any response.text.delta, response.text.done or response.output_item.added events containing assistant text.
this happens even when the AI says full sentences — not just tiny utterances.
no response.content_part.added or text delta events either.

I’ve checked everything on my end.. the connection is healthy and stays open.

session.update is acknowledged successfully.

Model used: gpt-4o-realtime-preview-2024-12-17. Prompts are simple and clean. This happens consistently across dozens of sessions.

questions:

Is this a known issue?
Are there any specific conditions under which the Realtime API would suppress text output entirely while streaming audio? For eg does function calling block assistant transcripts from coming in?

Bartek_Jach · May 27, 2025, 11:03pm

UPDATE: I assumed response.output_item.added or response.text.done would handle the text streaming but without the response.done handler logic you won’t get the assistant’s reply from data.response.output.

Topic		Replies	Views
Realtime API: Unexpected `response.text.delta` instead of audio events Bugs	0	31	August 21, 2025
Even with “modalities” set to “text” only in Realtime API, Audio is occasionally generated Bugs realtime , api-realtime , api-realtime-speech	3	1156	November 29, 2024
How to Get Text Output (Not Audio) from OpenAI Speech-to-Speech SDK (S2S) with Node.js? API api , assistants-api	2	77	August 4, 2025
Realtime API - No response audio or audio deltas, despite modalities being set to ['audio', 'text'] Bugs api	2	1336	July 14, 2025
Issue: OpenAI Realtime API Sometimes Only Responding with Text (No Audio) in Sessions With context API realtime , api-realtime	2	467	March 29, 2025

Missing response.text.done and response.text.delta events, receiving only audio responses

Related topics