Whisper large-v3 model vs large-v2 model

AyushSachan · November 30, 2023, 8:54am

I am currently working on a project where my objective is to transcribe audio calls from various languages into English. Until now, our application has been utilizing the large-v2 model, and we are considering migrating to the large-v3 model. However, upon testing both the large-v2 and large-v3 models on a set of 20 audio files, I observed that the large-v2 model generally produces better output compared to the large-v3 model, except in two instances where the large-v3 model performed better. Large-v2 transcripts are better by around 20 - 30%.

I am trying to understand if there’s something I might be overlooking. The large-v3 model is purported to be an improvement, yet in my experience, it seems to be the opposite.

For reference, I am using the code provided for the large-v3 model, which can be found here: huggingface[.]co/openai/whisper-large-v3.

amontanor · December 12, 2023, 10:36am

Hello @AyushSachan.

Have you managed to improve the results? The same thing is happening to me, I am getting better results with the V2 version.
I am also thinking about doing fine-tuning or pre-optimizing the audio.

I don’t know if you have managed to improve the results, if so, we could discuss the changes here.

AyushSachan · December 18, 2023, 6:53am

No, Im stil trying to figure it out how can i fix it.

cocoatouchg · April 18, 2024, 2:12pm

Has anyone done this test lately? Is Whisper v2 still considered better overall than Whisper v3? If that’s the case, it’s probably why OpenAI is still using Whisper v2 as the public API.

huynhnguyentoan · October 3, 2024, 3:20pm

I also get the same thing.
I tested on audio files, and it looks like large-v2 outputs the result better than large-v3.
Does anyone know why?

rumi2 · November 26, 2024, 5:21am

I think it depends on the quality of your input audio, V3 performs better in noisy environments.

Topic		Replies	Views
Why Whisper accuracy is lower when using whisper API than using OpenAI API? API api , whisper	3	4305	December 23, 2023
Whisper endpoint doesn't support the latest models? API	4	1435	February 13, 2024
How can I use the new whisper large-v3 model via API? API whisper	3	6605	March 6, 2024
Whisper hallucinations + dropped sentences: Help? API whisper	3	3096	February 29, 2024
I found whisper API perform better than colab run whisper large-v3 API api	0	601	February 26, 2024

Whisper large-v3 model vs large-v2 model

Related topics