[GPT-3.5-Turbo-16k] Response generation is slower now for Function Calls

zeeshanap · August 18, 2023, 11:18am

Hi Community,

I’ve been working on a project that uses the GPT-3.5-Turbo model. I demoed a version of the project mid July and the max response generation timeout set for the project is 8 seconds. For all the use-cases I have, the demo was working perfectly. But running the exact same code today, I see that the use-cases are failing. Specially the chatCompletion requests that result in function call request have a significantly higher latency.
I tried the exact same environment as last month so I’m sure almost nothing has changed on my setup. So I’m wondering if the APIs have gotten slower now. Has anyone else also experienced the same thing?

Thanks!

Topic		Replies	Views
Unstable speed of gpt-3.5-turbo-16k API api , gpt-35-turbo-16k , performance	6	1131	January 9, 2024
Slow Chat api responses ------ API	17	6569	December 24, 2023
API calls to davinci text 3 very slow and random speeds for identical prompts API	27	7023	December 25, 2023
Response times of GPT3.5 models API	3	502	November 24, 2023
Very slow response time with chatgpt-3.5 turbo model API API	17	11093	December 19, 2023

[GPT-3.5-Turbo-16k] Response generation is slower now for Function Calls

Related topics