Too long response time on API gpt-3.5-turbo model

webmaster.swapnil · March 27, 2023, 9:45am

When I tried multiple Arabic queries on the gpt-3.5-turbo model, it took approx 1 minute to 3 minutes time to respond query. But Davinci took approx 5 seconds to 10 seconds. Is there any suggestion to optimize response time?

taivo · March 27, 2023, 9:48am

Without access to what’s happening behind OpenAI’s API, a thought:

Did you try the exact same input on both? Input and output token count affects the response time, so that may be a cause of differences.

You can count tokens without calling the API: see this help page.

webmaster.swapnil · March 27, 2023, 3:18pm

Yes I have try exact same input on both. If I have use max_token value as 2000 then Davinci model also take approx 1 Minutes to 1.5 Minutes. Still it’s very high response time.

Is there any way to optimize the response time.
Can response time will be improve on paid module.

More information: I have tried https://api.openai.com/v1/chat/completions and https://api.openai.com/v1/completions

Topic		Replies	Views
Chatgpt-3.5 turbo model takes long time to respond. Is there any way to speed this up? API gpt-35-turbo , api-speed	7	6441	December 19, 2023
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	3	18666	November 9, 2023
Gpt-3-turbo slow vs chatgpt 3 website API	5	1638	December 16, 2023
Very slow response time with chatgpt-3.5 turbo model API API	17	10834	December 19, 2023
OpenAI API takes too long to response API api	2	625	March 25, 2024

Too long response time on API gpt-3.5-turbo model

Related topics