Issue with Arabic Transcription in Whisper-Large-V3-Turbo ("نعم" → "Naah"/"Naahe")

razau314 · June 18, 2025, 7:36am

I’m using whisper-large-v3-turbo to transcribe voice inputs in both English and Arabic. However, I’m encountering an issue where the Arabic word “نعم” (which means “yes”) is consistently being transcribed incorrectly as “Naah” or “Naahe”.

Has anyone else experienced this behavior with Whisper? If so, what strategies or configurations have you found effective in improving transcription accuracy for short Arabic words like this?

Any insights or suggestions would be greatly appreciated.

aprendendo.next · June 18, 2025, 9:39am

You can try a different model.

While whisper has less truncation problems, other models can perform better for specific languages.

https://openai.com/index/introducing-our-next-generation-audio-models/

Also, all models perform significantly worse than usual if the audio is too short with only a single word.

razau314 · June 18, 2025, 11:40am

My priority is to use a free model for transcription. Can you suggest any?

aprendendo.next · June 18, 2025, 3:24pm

Nothing that comes to mind ATM, but I’ve heard some people do fine-tuning on whisper to improve performance.

Colm_Roche · June 20, 2025, 1:30pm

Thanks for flagging this. Can you please generate a HAR file as you experience this error and email those details to support@openai.com?

Mohammed_Abed · September 14, 2025, 3:25pm

I had the same issues, but when I played with the parameters of my voice activity detection algorithm and the transcribed chunks sizes, it got fixed. One more important thing is the temperature, when you reduce it to around 0.1, this issue happens less often as I experienced.

Good Luck!

Topic		Replies	Views
Incorrect Transcription - Arabic voice returns Hebrew text Bugs whisper	0	129	October 2, 2024
Issue with Whisper ASR: Incorrect Language Transcription for Malayalam, Nepali, Telugu, and Others Feedback	1	405	September 23, 2025
Need Help Improving Whisper API Accuracy for Short Words and Pronunciation Tasks API whisper	0	313	December 13, 2024
Whisper hallucinations + dropped sentences: Help? API whisper	3	4065	February 29, 2024
Whisper API for Hindi Speech to Text API whisper	3	1385	March 5, 2025

Issue with Arabic Transcription in Whisper-Large-V3-Turbo ("نعم" → "Naah"/"Naahe")

Related topics