Whisper sometimes randomly skip sentence

mystik · November 29, 2023, 10:45am

When we create transcription using Whisper API we encountered weird error. Sometimes (handful times in an hour of audio) there is skipped sentence. Timing of previous and next sentence is adjusted to cover missing sentence without gap. Previous sentence is wrongly timed few seconds longer. Following sentence starts a few seconds earlier. When we run same file again errors appear on different places.

Does anybody else encountered this error?

Apostolate · December 25, 2023, 6:24pm

This skipping happens to me quite often and usually when the speaker I am transcribing is quoting something, almost as if Whisper is avoiding potential plagiarism or copyright or some such thing.

rboranga · February 29, 2024, 9:14pm

I’ve been using the Whisper API for some time, and I’ve noticed that it’s been acting “lazy.” It’s skipping important parts of the transcription, which didn’t happen before (I tested it on a model installed on my local machine, and the transcription is perfect, with 100% success in the transcription).
Furthermore, it seems to be random because if I try to transcribe the same audio file again, sometimes it transcribes the part it couldn’t transcribe in the previous attempt.
I transcribe phone calls, so I believe it wouldn’t fall under copyright issues.

tayloa17 · September 12, 2024, 9:36am

Yes, I get this issue too. Just noticed it recently that some sentences are being dropped randomly within the middle of a longer transcription. This is a real shame because it puts into doubt the quality of any transcription. A workaround for now, can be to use the phone apps built-in voice transcription services instead of using openAI apps transcription button. Or for pre-recorded content use otter.ai

Foxalabs · September 13, 2024, 2:57am

Does your transcription also contain quoted passages?

How is the audio quality at the time of the missing sentences?

Could you give a small snipped of the audio?

Is this repeatable?

If you wish to give private details you can use the forum DM feature to send this to me privatly.

jaketeater · February 18, 2025, 6:51pm

I have this same issue. The issue appears to happen the most often with medium. large-v3-turbo is performing better.

The audio files are about 1 hour long, and the portions that are skipped are when someone is reading from a book (basically a quotation) that is in the public domain in my country. The worst was one portion where it skipped 199 consecutive words (a 60+ second portion ).

The skipped portions are randomly placed when using the same model, however, they are always portions where the person is reading.

Erzhena_Gatapova · April 18, 2025, 10:58am

I am using whisperx and encountered the same issue! idk how to fix such issues in transcript and also the skipped phrase isn’t quote or smth(

Topic		Replies	Views
Whisper API skipping on parts of transcriptions API whisper	13	8148	December 27, 2024
Whisper skipping some parts of the audio Bugs api , whisper	1	1005	July 29, 2024
Whisper ASR Model Skipping Chunks in Audio Transcription Community whisper , transcribe	1	461	May 20, 2025
Whisper leaves out chunks of speech in longer transcript Bugs whisper	7	2719	March 5, 2025
Whisper hallucinations + dropped sentences: Help? API whisper	3	3672	February 29, 2024

Whisper sometimes randomly skip sentence

Related topics