Api response time too long

helviodarochalima · November 2, 2023, 12:42pm

The opening response time for access to the GPT4 model is very high. usually the prompt for a 100 word context and a 50 word prompt is taking more than 1 minute to respond. I use a bubble api to call from a nocode application. What can be done to optimize this call and not leave the user waiting so long?

cyruszei · November 2, 2023, 12:44pm

I also noticed this and I wonder why ? I understand the demands, but since this is a py-as-you-go API then there should be some limit to the response time so that it won’t take that long.

I have no idea why, but I think it has to do with higher demands

_j · November 2, 2023, 1:05pm

The API response should be streamed. The API will then give a word-by-word production like you see in ChatGPT.

If you don’t get the first token back from the model within a few seconds, you should close the connection and retry.

Are you literally waiting a minute before you see a streamed response start to generate? You can replicate the behavior in the API playground? Maybe this is the “latency” they promise to punish people with in the new tier system.

Topic		Replies	Views
API response time is insane (60+ seconds) API	3	1631	December 4, 2023
OpenAI API takes too long to response API api	2	771	March 25, 2024
GPT-4 API slow response over 60sec API	6	2421	February 16, 2024
Is there an issue with GPT 3.5 turbo 16k? API	5	931	October 27, 2023
Very slow response time with chatgpt-3.5 turbo model API API	17	10907	December 19, 2023

Api response time too long

Related topics