GPT-3.5 API is 30x slower than ChatGPT equivalent prompt

I have the same issue, we noticed this from Friday Oct. 13. I use gpt-3.5-turbo-16k-0613, It was taking more than 3 minutes. Today:

gpt-3.5-turbo-0613

result:  {
  object: 'chat.completion',
  created: 1697364500,
  model: 'gpt-3.5-turbo-0613',
  choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
  usage: { prompt_tokens: 2436, completion_tokens: 1120, total_tokens: 3556 }
}
time:  26.156s

gpt-3.5-turbo-16k-0613

result:  {
  object: 'chat.completion',
  created: 1697364546,
  model: 'gpt-3.5-turbo-16k-0613',
  choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
  usage: { prompt_tokens: 2436, completion_tokens: 787, total_tokens: 3223 }
}
time:  92.874s
1 Like