Realtime api transcribe gibberish

Jakob_Berglund · November 4, 2025, 1:40pm

Hi,

Working on an app for a few months that are using the Realtime API over WebRTC and it has been working pretty well until today.

Now it starts transcribing the user’s voice as complete gibberish in some conversations.
I can have a conversation and everything works great and then end that call and start a new and then nothing works.

See attached screenshot. For every user line the user said something that made sense and this is what it is transcribed as.

We are using: gpt-realtime-2025-08-28 and gpt-4o-mini-transcribe.

Is anyone else having problems with this?

torkage · November 4, 2025, 2:06pm

Hello, I’m glad you posted this because I’m experiencing exactly the same problem while it worked fine until today : Using WebRTC with Realtime API, user’s transcriptions randomly start to make no sense. And when this kind of gibberish transcriptions occur, the assistant seems blocked and stop responding. I have no idea why so hoping that this is fixed rapidly.

Jakob_Berglund · November 4, 2025, 2:09pm

Alright, nice to not be alone…
For us it will respond to the gibberish, but that could just be based on different instructions.

We were planning to start testing with external users this week, I guess that will not happen…

Adam_M_Evans · November 4, 2025, 8:12pm

Experiencing the same issue with realtime transcriptions with webrtc. Was working pretty well until today. Really hope this gets fixed asap.

Jakob_Berglund · November 5, 2025, 7:55am

Would be amazing if OpenAI would give some indication that there is an error and that they are working on it. Still today the same issue exist.

Adam_M_Evans · November 5, 2025, 8:37am

I agree. Right now, the issue is intermittent, but it’s happening fairly consistently — roughly 1 out of every 5 connections. When a client connects via WebRTC, the stream either transcribes correctly for the entire connection duration, or from the very start it fails by either transcribing incorrectly (nothing close to the input) or not detecting/transcribing speech at all. After the first sentence of a new connection, I can already tell whether that session will produce a valid transcription or if it will fail for the duration of that connection. So I have to cancel and retry until we get a “good server” or something. And to re-iterate, we did not detect this issue prior to 04 November, on either the legacy Beta API or the GA (/calls) API. Also tried gpt-4o-mini-transcribe, and gpt-4o-transcribe with the same issue. Anyone has ideas how to raise this issue to OpenAI so we know there is at least visibility from their side on this issue?
Do we know if the same issue exists if we use websockets instead?

Jakob_Berglund · November 5, 2025, 8:51am

We have not tested Websocket. For WebRTC we have the exact same experience as you have.

Letam_Barinua · November 5, 2025, 8:00pm

I am experiencing this exact issue right now. What could be wrong, has anyone been able to figure this out?

Jakob_Berglund · November 6, 2025, 7:27am

Might work over Websockets?

antoine.ferrere · November 6, 2025, 8:15am

Yeah we faced similar issues on Tuesday afternoon CET 4th Nov and yesterday. Exactly like you mentioned. No audio coming through to assistant neither transcript.

OpenAI mentionned an issue on their side “resolved” in 1h. Seems it look way longer.
Apparently working again now. But not great as we were testing with clients.

See here - corrupted audio over WebRTC. For some sessions it was working, for other it was absolutely hopeless.

Jakob_Berglund · November 6, 2025, 8:32am

Yeah looks a bit better now. But I had problems a lot longer than 1 hour. Hopefully it is starting to be resolved now. I think they added that later because when I was looking there was no indication of any problems. So the time was just made up.

Letam_Barinua · November 6, 2025, 8:34am

I tried this morning, it worked for the first few exchanges then started responding out of context with lots of gebberish transcriptions. I tried using a different realtime model and switching from their openai/agents SDK to raw WebRTC implementation but none seem to be working so far.

I’m wondering maybe some regions are still affected after their fix.

Adam_M_Evans · November 6, 2025, 12:36pm

just providing more info:

openAI had a blip on 04 November about corrupted input audio. That would explain the behaviour we see. However the issue has not been fixed.

I can confirm transcription via websockets works fine, however, there are other considerations such as being recommended for server to server only due to api key management, and sending audio in the correct format etc.

so it definitely seems to be specific to their webrtc apis, and its been a few days with no improvement.

Jakob_Berglund · November 7, 2025, 8:44am

If you are still seeing issues, please reach out to support@openai.com to try to escalate this issue. You will probably just talk to a Bot, but who knows?

juberti · November 10, 2025, 7:27pm

If you’re still hitting this issue, could you post your call ID (from the Location header in the /v1/realtime/calls response)? That will help us understand what’s going on here.

Letam_Barinua · November 11, 2025, 12:58pm

Hi @juberti thanks for responding to this thread. Please can you check this session for me. The user input are mostly unrelated to the conversation. I am using the Realtime SDK openai/agent version 0.3.0

Here’s my call ID: rtc_u0_7517910fb54d4c91b3ef41924d5998be

Adam_M_Evans · November 11, 2025, 1:01pm

I launched the test app I typically use to reproduce this issue. Previously, the issue would occur at least once within about 10 attempts. Although I haven’t used WebRTC for the past few days, as of today I can no longer reproduce the issue, so it appears to be resolved for me, and i have not changed my testing method. I’m not sure when the resolution occurred.

OpenAI_Support · December 8, 2025, 9:03pm

Thanks for flagging this and for the details shared so far. Issues like this are often related to the input audio, e.g. a microphone issue or an unexpected audio sample rate, but I'm happy to investigate further on our side.

For anyone still affected, could you share a few recent session IDs where you ran into the problem? It would also be helpful to see the transcription output along with any available metadata.

amandiosena · December 9, 2025, 7:00pm

Hi. Problem persists. You can take a look at this req id from audio.transcriptions.create
req_4c23350313954671b280782949d49a7b
whisper looks fine, but 4o-transcribe and 4o-mini-transcribe are returning nonsense consistently (random text in random language).
Transcription(text=‘Šiandien mes tikimės išsinuomoti būtent šito buto.’, logprobs=None, usage=UsageTokens(input_tokens=69, output_tokens=22, total_tokens=91, type=‘tokens’, input_token_details=UsageTokensInputTokenDetails(audio_tokens=69, text_tokens=0)))

original audio was a common sample:
”The sun rises in the east and sets in the west. This simple fact has been observed by humans for thousands of years.”

OpenAI_Support · December 10, 2025, 6:50pm

Thanks for the req_id, always helpful.

We checked the server-side trace and the request itself completed normally, which usually points to an issue with the input audio rather than a backend error. Common causes we see with “gibberish” transcriptions are microphone issues, unexpected sample rates, channel mismatches, or audio that’s being corrupted before it reaches the API.

A couple things we suggest trying:

Experiment with the audio format (for example WAV PCM16, mono, 16kHz or 24kHz).
Double check the mic source and ensure the audio being sent is what you expect.
Use the tracing feature to listen to the audio as received server-side. This can help confirm whether the audio sounds correct before transcription.

If you can share what audio format and capture setup you’re using, or what you hear when listening to the traced audio, we'd be happy to investigate further!

Topic		Replies	Views
[BUG] Realtime API - Transcript returns other users' data and internal tokens (gpt-realtime-2025-08-28) Bugs api , api-realtime	7	63	December 22, 2025
RealTime API Transcription errors Bugs realtime	7	2253	January 9, 2025
[BUG?] SIP Realtime – distorted or missing audio (noise/static) with `gpt-realtime` and `gpt-realtime-mini` Bugs gpt-realtime	9	273	November 7, 2025
Buzzing/Static Noise from gpt-realtime (WebRTC) Bugs api-realtime , gpt-realtime	9	422	December 27, 2025
Realtime API: session update doesn't change input audio format Bugs realtime	25	3109	November 19, 2024

Realtime api transcribe gibberish

Related topics