I have the same issue, we noticed this from Friday Oct. 13. I use gpt-3.5-turbo-16k-0613, It was taking more than 3 minutes. Today:
gpt-3.5-turbo-0613
result: {
object: 'chat.completion',
created: 1697364500,
model: 'gpt-3.5-turbo-0613',
choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
usage: { prompt_tokens: 2436, completion_tokens: 1120, total_tokens: 3556 }
}
time: 26.156s
gpt-3.5-turbo-16k-0613
result: {
object: 'chat.completion',
created: 1697364546,
model: 'gpt-3.5-turbo-16k-0613',
choices: [ { index: 0, message: [Object], finish_reason: 'stop' } ],
usage: { prompt_tokens: 2436, completion_tokens: 787, total_tokens: 3223 }
}
time: 92.874s