How Audio Speed Affects Transcription Accuracy: Benchmark Insights

Thanks for pointing it out man, I fuckin knew there was something wrong with mp3’s :heart:

This is exactly what happened:

  • started with wav, worked fine on my normal test file (~30s)
  • thought “I need more words for accurate WER”
  • found the files on wikipedia
  • got error because I was trying to send ~200mb in one go.
  • got lazy, didn’t want to implement consistent chunking.
  • changed file type to mp3 to reduce size :man_facepalming:

I then proceeded to check the file’s by listening to them, and concluded that it sounded fine to my human ears :rofl:

Ngl, that’s the funniest idea I’ve heard all day, I just have zero experience with autotuning stuff, could you point me in the right direction?

1 Like