Whisper Transcription Questions

david13 · December 6, 2022, 2:01pm

Is Whisper still in beta? I don’t seem to be charged anything for using it at the moment.
In a brief audio I submitted, it missed a few lines in the middle. In those lines, I included Spanish while the rest was in English – is that why it skipped them? Or does it randomly skip stuff in general?
Is transcribing things that “aren’t allowed” ie against the content rules a problem? (For example, transcribing a romantic love scene which it would be against the rules to have it create.)
Not a question, but I was impressed with how it put in quotation and punctuation marks. Other automated transcription options such as Dragon or Otter are nowhere near as good at doing this.

david13 · December 7, 2022, 12:01pm

I’ve been testing it some more and it keeps missing large chunks of the transcription. What it does transcribe is virtually perfect, but then it will miss a big chunk - maybe 30 seconds or a minute.

Is this a known bug? Is it being addressed? Outside of this, the quality of the transcription is phenomenal… but missing chunks makes it unfortunately unusable! I’ve tried other automated transcription programs like Dragon etc. but OpenAI’s transcription is significantly better. If only it were reliable!

(I’ve been using it in the Sandbox. Don’t know if there are other ways to access it.)

david13 · December 7, 2022, 5:22pm

And upon trying a longer transcription some from fiction I dictated… a disaster. Most of it simply didn’t appear.

Anyone know what the deal with transcription is? Is it “known” to be broken? Any chance of it working any time soon? The accuracy of the transcripts that it produces is OUTSTANDING… except for the absence of huge chunks of it!

ilianos1 · January 13, 2023, 3:05pm

As long the as the moderators on OpenAI’s Discord server are still deciding about my suggestion to create a channel for Whisper over there (where the community is a lot more active), I have connected to a few people on Discord via PM to talk about Whisper. There are useful discussions on GitHub as well.

The main question would probably be, how you set your parameters. For example, if the transcription gets “stuck” somewhere in the middle (which reportedly often happened during longer transcriptions), you should set the parameter condition_on_previous_text to False.

curt.kennedy · January 13, 2023, 3:09pm

It will only transcribe the first 30 seconds of whatever waveform you give it. To overcome this, I had to break the file into 30 second chunks, and feed each chunk separately for transcription, then stitch all the transcriptions together to get the final overall transcription.

And yes it works great! I ditched AWS Transcribe and went with Whisper!

ilianos1 · January 15, 2023, 10:28pm

I’m not sure which model your using or where you’re hosting it. But the kind of workaround to describe shouldn’t normally be necessary.

curt.kennedy · January 15, 2023, 10:41pm

I am just using the HuggingFace version:

https://api-inference.huggingface.co/models/openai/whisper-medium.en

And calling it from an AWS Lambda function.

ilianos1 · January 15, 2023, 11:57pm

I suppose the one on HuggingFace was deliberately limited because it’s hosted there for free. Better stick to the instructions on GitHub and you shouldn’t have that problem.

curt.kennedy · January 16, 2023, 12:09am

I got it to work, but good to know.

7diwakarbasnet · March 13, 2024, 4:57am

Hi @curt.kennedy, how does whisper transcribe words that are not in dictionary of some languages? Suppose my chinese audio contains use of word cat which let us assume is not in chinese vocabulary, what transcriptions are made for the word?

Topic		Replies	Views
Whisper hallucinations + dropped sentences: Help? API whisper	3	3316	February 29, 2024
Whisper-1 joint translation and transcription API	6	3152	October 21, 2024
Whisper API Limits - Transcriptions API whisper	11	13753	December 18, 2023
Whisper transcription failures and hallucinations API	4	715	April 5, 2024
Whisper API server error for long (not big) files API whisper	7	3540	December 18, 2023

Whisper Transcription Questions

Related topics