Speech To Text words details

merlino175 · November 11, 2023, 6:09pm

Hello everyone, is there any way to get more detail in the results of a transcript? I would need to know the exact time of each transcribed word/token, not whole sentences. Is this possible in any way?

nikola1jankovic · November 11, 2023, 6:33pm

Unfortunately not via API. You would need to use the open source Whisper model and combine it with additional alignment models.

Check whisperX, although the word level time codes are not perfect.

Topic		Replies	Views
Can whisper give timestamps for every single word instead of every 5-10 words? API codex , whisper	3	3409	December 14, 2023
Whisper API: a) Timecodes; b) how good is open-source vs API? API whisper	9	6346	July 28, 2023
How can I get word_timestamp? API whisper	1	3221	December 14, 2023
How to get Whisper's API to add timestamps to the transcripts? API api , whisper	5	16279	January 29, 2024
Hello team, I'm new with API, in open ai whisper and was wondering is there a way of getting transcript with speakers instead of time stamps? API	2	491	November 27, 2023

Speech To Text words details

Related topics