How to use whisper to handle long video?

_j · November 28, 2023, 7:20am

It means you need to encode it to a voice format that doesn’t waste so much data.

Whisper is open source, meaning that it can be used or recoded by anyone, and is light enough to be run on a 4GB GPU or slowly on CPU.

A person could, for example, use cloud Google Tensor processor ASICs and transcribe 50x faster than OpenAI can.

Other varieties you can run on your own good hardware can offer by-the-word timestamps or be oriented towards video transcriptions.

OpenAI API runs whisper-v2-large, but could be v3-upgraded without you knowing, as the newly released model is the same size.

Topic		Replies	Views
How to transcribe long audio to srt file directly? API whisper	3	5265	December 16, 2023
Questions regarding transcribing long audios (>25MB) in Whisper API API api , whisper	8	12401	December 15, 2023
Send an hours worth of audio through Whisper using node.js API	7	914	December 11, 2023
How to write a Python script for the new version of OpenAI Whisper API? API api	0	2035	March 21, 2024
Whisper API: a) Timecodes; b) how good is open-source vs API? API whisper	9	6686	July 28, 2023