Can whisper give timestamps for every single word instead of every 5-10 words?

yeetus1992 · February 27, 2023, 6:54am

I use whisper to generate subtitles, so to transcribe audio and it gives me the variables „start“, „end“ and „text“ (inbetween start and end) for every 5-10 words. Is it possible to get these values for every single word? Do I have to like, use a different whisper model or similair? I would use that data to generate faster changing subititles.

Would be grateful for any help!

Sincereley

raghavi.b · April 3, 2023, 8:44am

Hello, I’m trying to get timestamp for 5-10 words. I’m not able to do it. Can you please let me know how did you get timestamps from open ai , for 5-10 words.

fagunofficial3498 · October 10, 2023, 4:37pm

You can use the following library to accomplish this:

from pypi:

stable-ts

Topic		Replies	Views
How can I get word_timestamp? API whisper	1	3292	December 14, 2023
Speech To Text words details API whisper	2	820	December 14, 2023
How to get Whisper's API to add timestamps to the transcripts? API api , whisper	5	17177	January 29, 2024
Whisper API Latency is just too high! API whisper	2	4661	December 25, 2023
OpenAI TTS Transcription Time stamps API	1	259	May 10, 2025

Can whisper give timestamps for every single word instead of every 5-10 words?

Related topics