Variable Response Times on API gpt-3.5-turbo

rasengan · March 19, 2023, 4:50pm

I’m wondering if anyone is seeing variable response rates (e.g., sometimes the response is 5s while another will be 5m). I am seeing this behavior:

API CALL 1 TAKES PLACE at 9:30AM
API CALL 2 TAKES PLACE at 9:32AM
API CALL 2 RECEIVES RESPONSE at 9:32AM
API CALL 1 RECEIVES RESPONSE at 9:35AM

This is using the gpt-3.5-turbo API.

AgusPG · March 19, 2023, 5:44pm

Are both API calls different in terms of request parameters? We need more details to debug this. Your whole code, if possible

linus · March 19, 2023, 7:07pm

Hi @rasengan,

I think because others were experiencing timeout A lot of timeout error in the last few days that this also might be related to some changes and scale ups in OpenAI’s infrastructure - but this is only a guess, as @AgusPG mentioned, more details on your requests would help to provide a more specific answer

Topic		Replies	Views
API (gpt 3.5 turbo) calls taking variable time (ranging from 2-70 sec) , on similar input length API gpt-35-turbo , api	1	668	October 17, 2023
[GPT-3.5-Turbo-16k] Response generation is slower now for Function Calls API gpt-35-turbo , function-calling	9	3017	October 13, 2023
Unstable speed of gpt-3.5-turbo-16k API api , gpt-35-turbo-16k , performance	6	1111	January 9, 2024
Problem with API request (long answer time) API	3	2258	December 14, 2023
GPT-4o Inconsistent Response Times Feedback api	2	814	July 16, 2024

Variable Response Times on API gpt-3.5-turbo

Related topics