ChatGPT API responses are very slow

nandha · March 14, 2023, 3:56pm

ChatGPT API responses are very slow, even for short API calls with 200-400 tokens take 20-30 seconds. Is there any way to make the response faster.

ruby_coder · March 15, 2023, 1:44am

Hi @nandha

Yes, things are slow based on the demand. I just checked for you by sending 300 words of lorem epsom text to the chat completion API for you and got these results:

`gpt-3.5-turbo-0301`

Total Tokens: 826, Completion API Time: 16.17 seconds
Total Tokens: 866, Completion API Time: 14.434 seconds
Total Tokens: 1313,  Completion API Time: 38.629 seconds

I don’t think there is much you can do at the moment as the issue is with the performance of the turbo model(s.) You could switch to another model, which have tested to be faster than turbo these days.

HTH

Appendix: Example Completion

permelianunesfx · March 20, 2023, 4:15am

hi ruby_coder,
im using api and model gpt 3.5 turbo too. but the response is very slow. im calling api by python.

ruby_coder · March 20, 2023, 4:54am

Yeah, is it slow, for sure now, I tested again for you now, completion time was nearly 22 seconds:

My advice is to relax and do something less frustrating until the issue on the OpenAI infrastructure side improves, if you can.

HTH

sefimero · March 20, 2023, 9:27am

Yes, may be “turbo” it´s a little bit “pretencious” adjective for this model
I´m using curl with PHP on 500 tk max environment and the answers takes arround 30-50 secs to get ready.

swoh · March 20, 2023, 12:07pm

Mine, too… I’m also having connection errors like this.

openai.error.APIConnectionError: Error communicating with OpenAI: (‘Connection aborted.’, ConnectionResetError(104, ‘Connection reset by peer’))

apeinopu · March 20, 2023, 4:51pm

Same slowness here, plus occasional 502 Bad Gateway responses after a long wait.

sbusso · March 20, 2023, 5:55pm

Sadly, the API is throttled for normal paying users. And at the moment we are getting also a lot of errors. Not very usable in the current state and we hope OpenAI will find a solution soon.

jazzg · March 22, 2023, 6:17pm

Is there a way to avoid this error?
I got a loop that broke today after 5 minutes and I didn’t even notice when it did.

TensorHalo · March 23, 2023, 12:37am

The best way to adjust, I think is trying to change your solution to avoiding invoke the API or classify your demands to reduce the times calling it to decrease the total amount of time

james.gobert · March 23, 2023, 12:47am

I came here looking to see if other people were encountering this. I guess it is reassuring that it’s not just me. But also unfortunate because I’m hoping to launch my app in a few weeks and hope this improves.

Was gonna try using an another model but for this feature I need chat API to keep context. Guess I’ll just have to wait it out like everyone else.

jazzg · March 23, 2023, 3:42am

I’m using the @backoff.on_exception(backoff.expo, openai.error.RateLimitError) from backoff library. Trying

for i in rlist: 
    try: 
        #mycode
    except TimeoutError:
        print("error")
        continue

but it still breaks…

manudroid · April 11, 2023, 1:24pm

How do you get those reports of your queries? Is there an OpenAI webpage for that? I dont see that fine grained results in OpenAI API

AriaLiu · April 12, 2023, 7:41am

mark. Yes，Now he’s incredibly slow, and he’s getting slower and slower.I hope the official can improve it as soon as possible

jbackx · April 12, 2023, 9:39am

Performance of the OpenAI API is horrible for the moment. Are there plans to improve this soon because this instability in performance is blocking the roll out of our project.

seltz · April 12, 2023, 4:46pm

API responses have been consistently 20-50 seconds for about a week now- unusable when ChatGPT itself seems faster than it has ever been

mgbarri · April 12, 2023, 5:14pm

Is there a way to get someone from OpenAI to comment on this? Why are paying customers being rate limited into unusable latencies? The model is supposed to be “turbo” 30-40 seconds is not very “turbo” for some 100s of tokens. The API is wayyyyyy slower than the free chat? Why? I doubt it’s a technical issue, is that a strategic decision to limit developers? If so, I think OpenAI should be more “open” with the community

BrianLovesAI · April 13, 2023, 2:56am

I think there are too many people using OpenAI API services. Like, I am a bit shocked that people are now saying ‘gpt-3.5-turbo’ is slow, because I remember ‘gpt-3.5-turbo’ had a good speed, with +1000 tokens. So… I feel like the server is packed now.

But my issue is more serious, because my company is using gpt-4, and gpt-4 is way slower though it is accurate. We are about to launch this internally, and I can imagine that our customer service team might say the chat bot is too slow and complain.

matt.jiang · April 14, 2023, 5:37pm

totally suffered from the same problem, awesome response, but awful slow

erik.pragt · April 25, 2023, 5:44am

I just signed up for an OpenAI subscription myself, and I expected a similar response time as what ChatGPT is using, but using the gpt-3.5-turbo model, the response times are 30-60 seconds, or timeout completely.

I’m only using it for a demo application, but it’s almost unusable due to this performance, and it’s extra disappointing I had to pay for this to experience this.

Topic		Replies	Views
Chat GPT's API is significantly slower than the website with GPT Plus API	35	36590	December 12, 2023
Very slow response time with chatgpt-3.5 turbo model API API	17	10977	December 19, 2023
GPT-3.5 API is very slow. Any fix? API	31	9866	October 12, 2023
Slow Chat api responses ------ API	17	6406	December 24, 2023
Chat API is slow!, Fix it! API gpt-35-turbo , chatgpt , api	6	2600	December 24, 2023

ChatGPT API responses are very slow

gpt-3.5-turbo-0301

Appendix: Example Completion

Related topics

`gpt-3.5-turbo-0301`