GPT-3.5 API is 30x slower than ChatGPT equivalent prompt

ameramayreh · October 15, 2023, 10:27am

I have the same issue, we noticed this from Friday Oct. 13. I use gpt-3.5-turbo-16k-0613, It was taking more than 3 minutes. Today:

gpt-3.5-turbo-0613

result:  {
  object: 'chat.completion',
  created: 1697364500,
  model: 'gpt-3.5-turbo-0613',
  choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
  usage: { prompt_tokens: 2436, completion_tokens: 1120, total_tokens: 3556 }
}
time:  26.156s

gpt-3.5-turbo-16k-0613

result:  {
  object: 'chat.completion',
  created: 1697364546,
  model: 'gpt-3.5-turbo-16k-0613',
  choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
  usage: { prompt_tokens: 2436, completion_tokens: 787, total_tokens: 3223 }
}
time:  92.874s

Topic		Replies	Views
GPT-3.5 Turbo API response is slow API	20	12505	November 11, 2023
GPT-3.5 API is very slow. Any fix? API	31	9940	October 12, 2023
Chat Completion API super slow and hanging API	8	2301	December 13, 2023
Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo API	12	9590	July 22, 2024
We proved the API is intentionally slow API	56	18045	May 2, 2023

GPT-3.5 API is 30x slower than ChatGPT equivalent prompt

Related topics