Well, we learned a lot from you code, @derek8bai
You are talking about the chat completion
endpoint and not the completion
endpoint and you are using the gpt-3.5-turbo
model,
It is well know these new chat completion
models are very stressed with new users pounding on them and so they are very slow.
HTH