Whisper AI doesn't transcribe

Sorry, I totally missed notifications from this forum. I’m sure you have probably long moved on from this issue.
But for what it’s worth, I would have first tried to cut the first 10 minutes (the part that didn’t transcribe) and tried to send that to the API by itself, just to see what would happen.

I’ve had issues with parts of the audio not being transcribed. This happens whenever the audio starts with an “aah” or “ooh” sound, or any other vocalization/exclamation that is not a word. The transcription then may skip a long portion of the audio after this.

The same happens also if such a non-word sound follows after a long pause, though not as often.

In any case, when dealing with an hour of audio, I would probably add some code to automatically cut it up and send it as segments, and then resend any segment there was an error.

Something else (even though I haven’t tried this myself) you can do is modify the speed of the audio to slow it down, before you send it to the API. You’d do this automatically with ffmpeg or something. I’ve seen some anecdotes stating that this increases accuracy

3 Likes