Which loss function is used on Whisper model?

amitliron · May 2, 2023, 3:10pm

I read the article about Whisper model:

Robust Speech Recognition via Large-Scale Weak Supervision

They didn’t write which loss function did they used ?

It seem that they trained the model as classification task, so did they used cross-entropy loss ?

valentina3 · April 5, 2024, 8:43am

Hi!

I am writing my Master’s thesis about a Whisper related topic and need to discuss the loss functions used for training … I suspect it really is cross-entropy loss, but have you found some proof (other than forum blogs) by any chance?

Thanks!

Topic		Replies	Views
Loss functions and Optimizer for the funetuning of GPT 3.5 API gpt-35 , fine-tuning , api , documentation	0	488	January 11, 2024
Why Whisper accuracy is lower when using whisper API than using OpenAI API? API api , whisper	3	2006	December 23, 2023
Whisper medium WER does not decay for low resource language Community whisper	0	265	December 19, 2023
Audio-transcribe or Whisper API pricing query API whisper	4	1476	December 17, 2023
Whisper – Confidence Score API whisper	0	119	April 3, 2024

Which loss function is used on Whisper model?

Related Topics