Sometimes 1106 preview is slow

id88066 · November 23, 2023, 9:04am

Hey guys! I’m trying to use Azure openai’s the latest model(1106-Preview), and the openai python version is 1.3.5, found sometimes the API is very slow (30-40s). May I know if need to set some parameter in chat method or it is the issue about LLM? Thanks. Looks like the old version (0613) worked fine and no this issue.

_j · November 23, 2023, 9:13am

The model is not production ready. You might consider if OpenAI would actually made some of the interactions randomly extremely slow or failing just to prevent its use as a replacement yet.

The model also takes time to begin returning tokens often. With a new finish_reason of “content_filter” being available now, I expect the choice of sending to the moderations endpoint has been taken away from you — you now get “I’m sorry, I can’t do that” refusals unlikely to come from the AI model, unless they specifically trained it to be an a-hole. That could be another source of dependency and delay.

lhzcl226 · November 23, 2023, 11:33am

also sometimes gpt-3.5-turbo-1106 is very slow too, but gpt-3.5-turbo and gpt-3.5-turbo-16k-0613 is normal， anyone encountered this?

id88066 · November 27, 2023, 2:48am

Got it. Thanks for your reply!

Topic		Replies	Views
Gpt-4-1106-preview get slow API gpt-4	5	3023	February 26, 2024
Unstable speed of gpt-3.5-turbo-16k API api , gpt-35-turbo-16k , performance	6	672	January 9, 2024
Gpt-3.5-turbo-0613 model speed problem API	0	261	November 3, 2023
Is there an issue with GPT 3.5 turbo 16k? API	5	644	October 27, 2023
API GPT-4 model Response Slowly API gpt-4	3	773	December 17, 2023

Sometimes 1106 preview is slow

Related Topics