Hi, I’ve made a little flask route that does a whisper/chatgpt workflow for audio I send for transcription. It’s been working overall but a weird response keeps coming in for a specific audio. It’s just a bit from a podcast on willpower. A self-help oriented clip, that no matter how many times I put it through returns a bizarre pattern -over and over again. Here’s the contents (I have a summary generated and the transcript)
Summary: The video highlights the services of Transcription Outsourcing, LLC. The company is repeatedly mentioned, emphasizing its role in providing transcription services.
Transcription:
Transcripts provided by Transcription Outsourcing, LLC. Transcripts provided by Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC.
Obviously, this is not what’s going on at all in the audio, so it sent me looking for this company. I thought how can this be? That was a rabbit-hole move because its an actual company that does transcriptions. Who cares, right?
Then I found another post : (see audio-transcription-behaves-erratic/569684) from a year ago. Similar situation but not exactly. This post is explain they are seeing this companys name randomly within their transcripts. WTF is actually going on here?
Now, this happened three times with the specific file using my flask route. So i took the file converted it to another format and just submitted to whisper for a context-less, prompt-less transcription. It did in fact receive the correct transcript. I thought, “Oh, might have been a fluke maybe that company uses whisper all day as their bread and butter so the model is loaded with their name so if you have prompts in your whisper sends, it convolutes, or somehow interprets one of their rules to mark up your transcript.”
I know, this sounds bizarre but its the only working theory I have.
Figuring this was a whisper hallucination (see whisper-transcription-failures-and-hallucinations/705634) just for giggles I tried one more time while writing this post on my flask route. Keep in mind, I already got the transcript using a straight to whisper method with no prompt. But I wanted to test it out again to see if getting the correct transcript would happen using the route. Suprisingly, no, it just varied the summary promoting this company again. This is what was returned:
Summary: The video focuses on the frequent repetition of a company's name. A TikTok video humorously emphasizes the repetitive mention of "Transcription Outsourcing, LLC" multiple times.
Transcription:
Transcripts provided by Transcription Outsourcing, LLC. Transcripts provided by Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC. Transcription Outsourcing, LLC.
Like a mantra or some ghost in the machine stuff. What do you think? If anyone wants the file to test I can provide it. Utterly baffled right now.