Running Whisper on AWS GPU - Memory Error

milton.leal · March 6, 2023, 2:20pm

I am running Whisper on AWS EC2 g3s.xlarge. I have a bunch of long (~1 hour) audio files and want to use the Whisper Medium model to transcribe them. My code works fine for the first file and then it crashes with following error message:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 16.00 MiB (GPU 0; 7.43 GiB total capacity; 6.72 GiB already allocated; 15.44 MiB free; 6.74 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Does anyone know how can I handle this?

curt.kennedy · March 6, 2023, 2:22pm

Use the API and ditch the EC2 instance!

milton.leal · March 6, 2023, 2:56pm

This is how I am using it now:

model = whisper.load_model(“medium”)
whisper_results = model.transcribe(audio_filepaths[i], fp16=False)

Is there a difference between using the above code and using the code from the new Whisper API?

milton.leal · March 6, 2023, 3:05pm

Also, I see that larger than 25mb files need to be broken in order to use the new API.

curt.kennedy · March 6, 2023, 3:19pm

The API version of Whisper uses the Large model, and yes you have to break up files > 25 Megs.

Topic		Replies	Views
How to write a Python script for the new version of OpenAI Whisper API? API api	0	1840	March 21, 2024
Can't find a way to transcribe files bigger than 25 MB API api , whisper	4	2262	March 15, 2024
Whisper API Limits - Transcriptions API whisper	11	14964	December 18, 2023
Whisper: Maximum content size limit exceeded API whisper	9	16674	December 18, 2023
Whisper API server error for long (not big) files API whisper	7	3690	December 18, 2023

Running Whisper on AWS GPU - Memory Error

Related topics