RealTime API echo issue when on speaker phone

toddjyoung123 · December 23, 2025, 9:34pm

I think it is impossible to keep the AI from being interrupted when the phone calling in is on speaker phone. The intro starts when a caller phones in, and inevitably everyone tests with their speaker phone and it picks itself up and starts cutting out etc. as its hearing itself/echo.

Has anyone figured this out? My use case is handling incoming service phone calls. So you can’t control the callers device at all.

Using twilio stream, node and realtime api

chinmay1 · December 24, 2025, 12:37am

You can change VAD setting including Semantic VAD.

toddjyoung123 · December 24, 2025, 2:34pm

Hi, thanks, yes I’ve played around with various VAD settings but to no avail. I did find another mention of my exact issue in the forum and he gave up and went with eleven labs. That is probably what I’m going to do as well. dont have the issue, much better voices. Its a drag having got this far but I think the solution needs to be done on OpenAI’s end at least for phone calls where you don’t control the callers phone.

chinmay1 · December 25, 2025, 3:54pm

We are trying Pipecat. Will report the results as we get done.

toddjyoung123 · December 25, 2025, 5:27pm

Hi, something I discovered that fixed my particular issue, maybe useful: the echo/interrupt problem only occurred on the initial greeting. I’d start to hear the greeting, it would think its being interrupted and would jump etc.. So I changed the code to ensure the greeting can’t be interrupted and that totally fixed the issue. Kept all the same otherwise.

session: {
modalities: [“text”, “audio”],
instructions: buildInstructions(ctx),
voice: normalizeVoice(ctx.voice),
input_audio_format: “g711_ulaw”,
output_audio_format: “g711_ulaw”,
turn_detection: {
type: “server_vad”,
threshold:.85,
prefix_padding_ms: 300,
silence_duration_ms: 500,
create_response: false,
interrupt_response: false,
},

Topic		Replies	Views
RealtimeAPI echo cancellation doesn’t work for the first ~10 seconds of session API ios	1	313	December 25, 2025
Background Noise Interfering with Realtime API Using Phone API realtime	14	4931	July 31, 2025
Issue with realtime api user interruption API realtime	6	2307	October 24, 2024
RealtimeAPI audio feedback Feedback gpt-4	9	1130	January 30, 2025
Realtime API starts to answer itself with mic+speaker setup API realtime	6	3231	November 22, 2024

RealTime API echo issue when on speaker phone

Related topics