Whisper API costs 10x more than hosting an VM?

AteneaIA · December 25, 2023, 10:55am

Whatever the model, the API consumption does not correspond.

ps-06 · December 29, 2023, 8:27pm

Hi @anon34024923 ! I’m curious about the “multiplied by 20”? Do you mean that you can run 20 parallel inference jobs on a single 4090 for $0.74/h?

DevGirl · January 19, 2024, 1:52am

Whisper is so light on resources, I just run it locally.

I run a threadripper 16 core with cheap RTX 3060 – and I can generate amazingly accurate subtitles for an entire movie in a couple minutes.

jlvanhulst · February 9, 2024, 9:13pm

Did you ever go and try it out?

I just signed up for runpod and tried both. The ‘serverless’ option, which is the only one that is truly ‘by the second’ works the way you describe only in theory I think. You can create requests - that have a delayed (can’t really predict) start and then run. All works fine, your really not owning these GPU’s in that you can schedule to have those parallel things happen. You don’t have much control at all. So I think its a perfect solution for running tasks where time passed doesn’t matter much and parallel even less. Anything else you’d have to get a ‘server’ - and those are all by the hour. So you pay regardless of use.

So I would say the by the second serverless is more a ‘dev mode’ or ‘rarely used’ setup but the math that we looked at does NOT apply IMO.
Curious what experiences others have running stuff on runpod

Topic		Replies	Views
Is Whisper API really 10x more expensive than self hosted? API whisper	3	8433	January 5, 2024
Smarter to scale with Whisper API or host whisper yourself? API gpt-4 , chatgpt	3	3778	December 16, 2023
TTS API service usability API tts	17	7123	December 16, 2023
Fine-tuning flagged Moderation Policy customization API fine-tuning , moderation	17	1617	December 16, 2023
Whisper API: a) Timecodes; b) how good is open-source vs API? API whisper	9	6482	July 28, 2023

Whisper API costs 10x more than hosting an VM?

Related topics