Whisper API hallucinating on empty sections

id.luchkin · April 29, 2023, 9:56pm

I have some success fighting this issue just processing the file through ffmpeg with a silenceremove command before sending the file to Whisper. Something like this: ffmpeg --fflags +discardcorrupt -y -i <file_name> -ar 8000 -af silenceremove=start_periods=1:stop_periods=-1:start_threshold=-30dB:stop_threshold=-30dB:start_silence=2:stop_silence=2. You would probably change the -ar (the sample rate) and some silenceremove flags depending on your audio, for that you can refer to this page.

Topic		Replies	Views
'Transcription Outsourcing, LLC' repeated throughout whisper transcript API api , whisper , hallucinations , audio	18	729	October 5, 2024
Weird whisper transcription links to FEMA.gov Bugs whisper	4	794	June 24, 2024
Whisper hallucinations + dropped sentences: Help? API whisper	3	3485	February 29, 2024
Whisper spitting out gibberish when trying to transcribe API whisper	4	1091	June 14, 2024
Hallucination on audio with no speech API whisper	7	7516	December 25, 2023

Whisper API hallucinating on empty sections

Related topics