GPT-3.5 API is 30x slower than ChatGPT equivalent prompt

anon5861895 · October 15, 2023, 8:24pm

Did anyone apply for Azure OpenAI recently? Can you share your experiences?

I’ve looked at their application form. They say they provide Azure OpenAI for enterprise customers only. They require a business phone number.

I have a business license in my country, but I’m only a guy at home. Do they call you at the provided phone number to check on you?

If any of you are a business with office, you should apply for Azure OpenAI asap. This is not a service for business. This feels more like a field of experimentation.

wboudabbous · October 15, 2023, 9:51pm

Is it the GPT Vision that consumes platform capabilities? as even on the Chat app, I’m less than 4 t/s.

tim14 · October 16, 2023, 3:55pm

For me, the slow down is also occurring but with all models. In the playground and via normal API access. Last time I used it (about two weeks ago) everything was at normal speed. Now, for about 250 total tokens it’s like 6-9 seconds, before 0-3 seconds.

justin8 · October 16, 2023, 4:05pm

I am also having the slowdown issue. With gpt-3.5-turbo, responses are taking 3-4 minutes (~3500 total tokens). My larger requests against gpt-3.5-turbo-16k-0603 just time out. This issue is new since the end of last week.

Have tried this both running locally, as well as from AWS EC2, same results.

ling.barak · October 16, 2023, 4:21pm

Same here, from Israel, slow responses for context related (3-5), and very slow respone for off topic questions (sometimes more than 30 sec).

Used to be 1-2, 3-5 until 3 or 4 weeks ago

anon5861895 · October 16, 2023, 4:32pm

It looks like the symptom is spreading to more people. Could you guys report this symptom to help.openai.com? Use the chat button on the lower right corner. Maybe link this post along with it.

They give a response, even if it’s only a standard answer.

I already did it. Maybe more people reporting the same issue would help escalate it.

I found a pattern. My account, which was made on July 1st, has normal speed as of now. Accounts created after that day are all slow.

It looks clear to me now. OpenAI is directing recently created accounts to crowded nodes. That’s why the oldbies in this forum are seemingly oblivious to this issue.

grandell1234 · October 16, 2023, 4:58pm

Hopefully, this fix helps.

grandell1234 · October 16, 2023, 5:13pm

Update:
Screenshot 2023-10-16 at 12.12.40 PM

anon5861895 · October 16, 2023, 5:28pm

Sorry, no good

I present one more side-by-side comparison, recorded a minute ago. https://www.youtube.com/watch?v=JD5rVRIuzWs

Left one - account made on July 1st.
Right one - account made after.

grandell1234 · October 16, 2023, 5:36pm

This is different then and a completely different problem, this has to do with the date the account was created on. That is so odd. Please supply the version you were using in playground. Because this is different than the current topic please create a new topic and list your issue.

trackscatsteelskylab · October 16, 2023, 6:46pm

The issue has not been resolved.

In fact it’s slightly worse now, as it has gone from an average 34 seconds to 37 seconds now in my original test query.

grandell1234 · October 16, 2023, 8:03pm

I would create a new OpenAi report. They probably removed it due to them believing it is fixed.

monoganog · October 16, 2023, 9:49pm

I’m also experiencing significantly longer API wait times for the past 8-9 Days.

A request that would normally take 15-30 Seconds is now taking upward of 75!

Hope open AI can communicate better on this rather than leaving us in the dark, especially because these longer wait times are a significant UX barrier in already launched apps.

johnaffolter279 · October 17, 2023, 4:41am

I am having the same issues. We are seeing around 38-45 seconds or timeouts on things that used to take around 10-15 seconds max. It is making it unusable for our application.

ameramayreh · October 17, 2023, 8:31am

It looks the speed depends on how much you spend.
As we are in implementation and testing stage, spending nor more than $10 a month, it is very slow.
A friend of mine, who spend around $500 a month on prod load says it is super fast.

It looks OpenAI have different infrastructure for different accounts based on their stage (pre-prod or prod), and this is determined per spending. Anybody can confirm?

Thanks.

chriskoch · October 17, 2023, 3:52pm

The observed difference in response times could be attributed to various factors such as network latency, server load, or other technical considerations. To investigate and address this issue, it would be advisable to reach out to the support team or contact OpenAI directly with information about the specific account and usage details. They will be better equipped to assist and provide insights regarding the slow response times experienced with the GPT 3.5 Turbo API.

@contact:gptnederlands.nl

anon5861895 · October 17, 2023, 4:26pm

It’s not like some of us haven’t reported this to OpenAI.

If your account is slow, check when you registered your account. My theory is that accounts registered on or before July 1st are okay.

Another approach is to see if you still have the record of your free trial credit on your usage screen.

Anyway, since I have an account that isn’t affected, I’ll use it and forget about this. I hope you guys get a solution soon.

If my guess is right, maybe accounts registered in July will get a release from the “rate limit” on the first day of the next month.

jaffy86668 · October 17, 2023, 6:08pm

So, randomly everyone is noticing a 3-5-fold increase all of a sudden and it is something to do with our individual accounts with everything else at parity? I feel like there is something more to this.

Foxalabs · October 17, 2023, 6:24pm

This is certainly not being experienced by everyone, this is a plot of the time taken to generate 256 tokens by GPT-3.5 and GPT-4 over the past week, there are some blips from infrastructure issues and the normal issues of internet connected API’s but in general the performance looks to be unchanged.

mhall72 · October 17, 2023, 6:26pm

My account was created well before July 1st and is experiencing this slowness. I also created a new organization and am also experiencing the same issue with the new org and keys. Based out of Toronto, with great internet speed.

Of the people experiencing the slowness, what is your monthly usage limit? Mine is $120 and I’ve requested it raised as I will be boarding many new clients soon on a service which depends on this API. Not sure if there’s a correlation to that spend limit or possibly location.

Topic		Replies	Views
GPT-3.5 Turbo API response is slow API	20	12025	November 11, 2023
GPT-3.5 API is very slow. Any fix? API	31	9814	October 12, 2023
Chat Completion API super slow and hanging API	8	2082	December 13, 2023
We proved the API is intentionally slow API	56	17295	May 2, 2023
Error: 429 Too Many Requests API	56	13692	December 2, 2023

GPT-3.5 API is 30x slower than ChatGPT equivalent prompt

Related topics