Overloaded, but still paying

mikkelhojlund · May 13, 2023, 7:02pm

Hi, Has the gpt-3.5-turbo been overloaded all day? I see that my apis are failling, but OpenAI is still charging full price for the call even though they only deliver an error?

jwatte · May 13, 2023, 10:22pm

I’ve found that there’s some kind of caching involved.
Sometimes, when I get an error, and I re-try the request within a few seconds, I get a very fast answer, so it seem as if they cache the exact request.
It might be that the “overload error” is really a “gateway timeout error,” and the model actually keeps inferring on the back end, even after the gateway has expired.
If that’s the case, then it looks to their system as if your request did indeed generate a bunch of work for them.

A single inference is so low cost that I don’t worry about the cost here. If it were to happen to, say, 50% of requests, then that might be a different question…

Also, yes, it feels as if the API has been a lot slower in the last week.

Topic		Replies	Views
502 on GPT-4 for the past 18 hours and 10% on GPT-3.5-Turbo API gpt-4 , api	1	783	June 2, 2023
The error message of "That model is currently overloaded with other requests. " using gpt-3.5-turbo API	10	6861	December 18, 2023
That model is currently overloaded API	4	2933	December 18, 2023
[GPT-3.5-Turbo] ‘The server is overloaded or not ready yet’ errors API chatgpt , api	11	8659	February 4, 2024
🐢 : GPT4 extremely slow on GPT4 API and ChatGPT API gpt-4 , chatgpt	18	3549	April 9, 2024

Overloaded, but still paying

Related topics