Is API down for everyone or just me?

markwwd · March 20, 2023, 12:50pm

So…

It’s been 6 hours here and my API requests are still being timed out.

What’s going on? Is anyone else experiencing this? Status page says that everything’s okay.

ruby_coder · March 20, 2023, 12:51pm

Why not search the site and read the other topics on this before posting?

Or at least look at the topics before posting? Why not?

Thanks

markwwd · March 20, 2023, 12:54pm

I did, but I’m flabergasted that more people aren’t posting this…

ruby_coder · March 20, 2023, 1:05pm

Amazing.

I have seen post after post on this topic all day, so much it is simply spam and noise.

Have a great day.

Ahtesham · March 20, 2023, 1:06pm

Facing same issue.

So the API is down. Right?

AgusPG · March 20, 2023, 1:17pm

No. No right. The API is working for me with a proper retry/fallback strategy.
This is truly fascinating.

Ahtesham · March 20, 2023, 1:34pm

Are you using that for fine tunning?
I am trying to fine tune the model. but seems like its getting discounted every time.

logankilpatrick · March 20, 2023, 1:48pm

Please see the status of the API here: https://status.openai.com/

markwwd · March 20, 2023, 1:54pm

The status says there’s no issues currently, yet everyone using 3.5 or 4 API is getting “timeout” when sending a request.

AgusPG · March 20, 2023, 2:07pm

Nope, no fine-tuning now. I’m talking about requests to https://api.openai.com/v1/chat/completions.

crowdreactor · March 20, 2023, 3:01pm

API timing out for me too, even with a retry policy with 5 tries.

AgusPG · March 20, 2023, 3:02pm

Can you share further details? What is the timeout per retry? Do you fallback to other models? Do you have backoff?

crowdreactor · March 20, 2023, 3:06pm

It is 5 retries with exponential backoff (1, 2, 4, 8, 16), no I do not fallback to other models, but that’s a good idea. However not sure if DaVinci is also timing out and it might require different prompts than 3.5 Turbo.

Ideally 3.5 just works consistently.

AgusPG · March 20, 2023, 3:07pm

And what is the custom timeout per API call?

crowdreactor · March 20, 2023, 3:09pm

The timeout is currently at 20 seconds.

AgusPG · March 20, 2023, 3:16pm

@crowdreactor thanks a lot for sharing the details about the implementation. This is the only way we developers can help other debug their problems. So, for your case, I’d say:

5 retries is probably too much, especially if you’re using the same model all the time.
20s timeout is probably too short, especially if you’re asking for huge completions and you’re not streaming.

If your base model is gpt-3.5-turbo, I’d say to experiment with something like:

1 call to turbo with timeout = 30s.
Wait for 4s.
1 call to turbo with timeout = 30s.
Wait for 8s.
1 call to davinci-003 with timeout = 30s.

And yeah, the output would obviously depend on the model. You can try to optimize your prompt for your model. Even if you don’t, it’s usually better to return something rather than nothing. Anyways, the actual implementation totally depends on your use case. You might want to set up even longer initial timeouts (1min or more), especially if your customers do not need online interaction with your app.

crowdreactor · March 20, 2023, 3:38pm

Alright thanks I will try that out. Unfortunately this is a web app and customers need to interact with it.

For what it’s worth, I’m using the same prompts I’ve always used, and they have been lightning fast before yesterday. It’s only since yesterday since these issues have started, and it doesn’t look like I’m the only one.

AgusPG · March 20, 2023, 3:45pm

You’re not. But this is not the first time that this has happened. And it won’t be the last one. Scaling up technology and predicting demand is not as easy as some people seem to believe. So our apps need to be ready to deal with partial or global outages. Because they will eventually happen.

wisnumpak · March 20, 2023, 4:22pm

same here, cannot access chat.openai.com for the whole day.

anon10827405 · March 20, 2023, 4:42pm

The actual chat has gone down for me as well.

Reporting 429. Hopefully they get their scaling figured out.
Must be an incredible task considering how fast everything is growing.

Topic		Replies	Views
Partial outage across ChatGPT and the API API api-outage	17	3206	November 9, 2023
ChatGPT API and chat.openai are not responding to prompts API gpt-4	32	8020	July 15, 2024
Gpt-3.5-turbo extremely frequent timeouts API	34	9035	December 22, 2023
Status code 503: That model is currently overloaded with other requests API	33	38535	March 21, 2023
Looks like we are down! OpenAI status shows API partial outage but API	4	867	March 20, 2023

Is API down for everyone or just me?

Related topics