Disable timestamps for Whisper?

Rrrila · May 10, 2023, 9:26am

Hi,

I have been searching all over the internet, including the official documentation of whisper, but I can’t find a way to disable timestamps on whisper transcripts. I’m using a colab.land project with the following line:

!whisper {input_path} --model large-v2 --language English --output_dir {output_folder} --output_format vtt

Can you help me on this? I’m not a developer myself, so I might have miss something. I have seen other hugging face projects where you can actually choose activate or deactivate timestamps for the output.

Thanks, Kind regards

gilman.outreach · May 10, 2023, 10:10am

Hello Rrrila,

I came to this forum seeking solution for the same issue. Found nothing yet, unfortunately. As far as I know, the Whisper library does not have a built-in option to disable timestamps for the transcripts. Well… you can always try manually removing the timestamps from the output file after running the command you mentioned. You can do this by opening the VTT file in a text editor and deleting the lines that contain the timestamp information.

Also, it would more reasonable to reachout to the developers of the Hugging Face library or the Whisper library directly seeking the solution (waiting for their response rn)

If I’ll be lucky enough to find the solution, I’ll make sure to post here

Rrrila · May 10, 2023, 10:18am

Hey!
Thanks for your answer, although, I know I can manually amend it (or using a script for it) that is not ideal solution, specially on languages such as Arabic, due to the fact that it repeats words many times on different time stamps, as translating to Arabic has too many complications, so removing timestamps will help on not to repeat words.

jayantyadav202 · May 26, 2024, 7:16am

Hi!
My motive was also to disable timestamps, but in hopes to get less halucinations. But OAI Whisper does not provide a way to disable it during inference. HF Whisper provides disabling through return_timestamps param in generate method, though only for short-form (<30 secs) clips.
I believe the reason it cannot be disabled for >30 sec clips is because a segment’s decoding depends on the timestamp predicted from its previous segment. Check section 4.5 of Whisper paper:

Regards,
Jay

Topic		Replies	Views
How can I get word_timestamp? API whisper	1	3066	December 14, 2023
How to get Whisper's API to add timestamps to the transcripts? API api , whisper	5	13742	January 29, 2024
Whisper API: a) Timecodes; b) how good is open-source vs API? API whisper	9	6104	July 28, 2023
Whisper API & Word-Level Time-stamping API whisper	6	18533	December 14, 2023
Can whisper give timestamps for every single word instead of every 5-10 words? API codex , whisper	3	3324	December 14, 2023

Disable timestamps for Whisper?

Related topics