Whisper api produces transcription in korean on no speech

prabh.srinivasan · October 10, 2023, 6:06pm

I have an audio recording that contains no human speech, it’s actually the audio from a video where a woman is cleaning her kitchen. Surprisingly, the OpenAI audio transcription API produces a hallucinated transcription in Korean.

I was expecting the Whisper API to produce an empty transcription for such audio files because I’m developing an application that anticipates audio with or without speech.

Looking for any suggestion to overcome the problem.

Foxalabs · October 10, 2023, 6:24pm

Hi and welcome to the Developer Forum,

You might search for some of the work Nvidia did with the RTX series cards on detecting and isolating speech, it’s actually a non trivial problem. AI’s will always try and find the best match given the input, unless that input is silence there is always a probability of a false detection.

qrost · November 24, 2024, 6:56pm

I have been experiencing something similar.

When i input a short English audio voice wav to OpenAI Whisper api, it would occasionally return the Korean translation of my English speech content, though the content and meaning seems to be mostly correct. Correct meaning, wrong language.

I am not a Korean speaker though, so I don’t think any of my setting was pointing to Korean. What could be wrong?

anon10827405 · November 24, 2024, 8:00pm

Set the language parameter to English and mode to transcription.

If it’s not set then a classifier is used to try and guess the language

Topic		Replies	Views
Unexpected Welsh Language Output from English Audio Inputs? API whisper	2	1529	December 16, 2023
Whisper transcription translates to random language (Malay) API whisper	8	1094	July 16, 2024
Whisper is translating my audios for some reason API whisper	23	11085	April 7, 2025
Whisper wrong results - from other users? API whisper	1	85	November 24, 2024
Input_audio_transcription accuracy API realtime	6	562	November 6, 2024

Whisper api produces transcription in korean on no speech

Related topics