Whisper AI doesn't transcribe

Bai_Lan_Blues · September 25, 2024, 7:48pm

Sorry, I totally missed notifications from this forum. I’m sure you have probably long moved on from this issue.
But for what it’s worth, I would have first tried to cut the first 10 minutes (the part that didn’t transcribe) and tried to send that to the API by itself, just to see what would happen.

I’ve had issues with parts of the audio not being transcribed. This happens whenever the audio starts with an “aah” or “ooh” sound, or any other vocalization/exclamation that is not a word. The transcription then may skip a long portion of the audio after this.

The same happens also if such a non-word sound follows after a long pause, though not as often.

In any case, when dealing with an hour of audio, I would probably add some code to automatically cut it up and send it as segments, and then resend any segment there was an error.

Something else (even though I haven’t tried this myself) you can do is modify the speed of the audio to slow it down, before you send it to the API. You’d do this automatically with ffmpeg or something. I’ve seen some anecdotes stating that this increases accuracy

Topic		Replies	Views
Whisper leaves out chunks of speech in longer transcript Bugs whisper	7	2333	March 5, 2025
Whisper api, not transcrip all audio API whisper	3	2068	October 28, 2023
Dialog before long pause gets repeated over and over again by Whisper API whisper	3	2198	November 6, 2023
Whisper API skipping on parts of transcriptions API whisper	13	7673	December 27, 2024
Whisper skipping some parts of the audio Bugs api , whisper	1	946	July 29, 2024

Whisper AI doesn't transcribe

Related topics