What minimum bitrate should I use for whisper?

jan.b · April 25, 2023, 7:46pm

I’m transcribing files that are around 25MB—sometimes slightly bigger. Those currently have a 128k bit rate.

Instead of cutting the files into parts, I figured I might lower the bitrate instead. Or would that reduce the transcript quality?

On the open source whisper project, someone wrote that internally whisper is downsampling to 16k.

Is that the same for the whisper-1 model? Can’t find that in the docs.

If so, downsampling the input files couldn’t possibly harm the transcript quality—right?

jdc2106 · August 30, 2023, 8:53pm

I came for this answer as well; how low can we reduce the quality?

_j · August 31, 2023, 3:36am

If you can still understand it, the AI probably can also. Some of the training may also be on lossy compressed audio to match what you hear.

The exact levels of codec compressions aren’t clearly documented. For mp4/aac, the HE-AAC codec can be good for voice down to 16-24kbps, before it starts to sound swishy.

Topic		Replies	Views
Does audio file size have any impact on Whisper performance? API whisper	4	3911	December 18, 2023
Impact of WAV vs M4A on Whisper Transcription Quality API whisper	1	676	September 6, 2024
Send an hours worth of audio through Whisper using node.js API	7	505	December 11, 2023
Issue with speech-to-text MP3 size API whisper	6	901	April 26, 2024
Local Whisper Development Questions Community whisper	11	4164	December 25, 2023

What minimum bitrate should I use for whisper?

Related topics