RealtimeAgent with WebRTC: Handoff Agent Always Outputs Audio

zanejp · July 21, 2025, 1:04am

Hello everyone,

I’m currently working with the OpenAI Agents SDK, setting up a RealtimeAgent with WebRTC for voice interaction.

My setup works as expected with the primary RealtimeAgent: it respects my modalities configuration, and I consistently receive text output via response.text.delta and response.text.done events.

However, when switched to the handoff agent, the modalities setting is ignored and audio is always output.

Has anyone else encountered this specific issue where the handoff agent defaults to audio-only output and doesn’t emit text-related events, even when explicitly configured?

Thank you in advance for your help!

My setup is as follows:

const mainAgent = new RealtimeAgent({
  name: "Main Agent",
});

const subAgent = new RealtimeAgent({
  name: "Sub Agent",
});

mainAgent.handoffs = [subAgent]

session.current = new RealtimeSession(mainAgent, {
  model: "gpt-4o-mini-realtime-preview",
  config: {
    modalities: ["text"],
    inputAudioTranscription: {
      model: "whisper-1",
      language: "en",
    },
    turnDetection: {
      type: "semantic_vad",
      eagerness: "low",
      create_response: false,
      interrupt_response: false,
    },
  },
});

Events

currentSession.transport.on("response.text.delta", (event) => {
  // Received only when primary agent
});

currentSession.transport.on("response.text.done", (event) => {
  // Received only when primary agent
});

Topic		Replies	Views
Realtime API: Unexpected `response.text.delta` instead of audio events Bugs	0	107	August 21, 2025
How to Get Text Output (Not Audio) from OpenAI Speech-to-Speech SDK (S2S) with Node.js? API api , assistants-api	2	221	August 4, 2025
Missing response.text.done and response.text.delta events, receiving only audio responses API	1	375	May 27, 2025
Can't Hear Inbound Audio from OpenAI Realtime Agent (WebRTC) - Oubound Works, Inbound Stuck at 1 kbps API ios-app , agents , audio , api-realtime-speech	2	218	November 10, 2025
Realtime API with WebRTC: issue when the modality is updated to ["text", "audio"]: the audio output is missing Bugs api-realtime	1	129	June 24, 2025

RealtimeAgent with WebRTC: Handoff Agent Always Outputs Audio

Related topics