Gpt-3.5-turbo-1106 model hangs for 10 minutes every couple of requests

AhmadBarqawi · November 21, 2023, 6:40pm

I am using the gpt-3.5-turbo-1106 and I am doing a lot of request back to back, I noticed that every 10 - 15 requests the model hangs for sometimes more than 10 minutes to complete the on going request. This was not the case last week.

adaptiv · November 22, 2023, 12:44am

Yeah, it not only hangs it also re-generates the messages when used over the assistant API. Check my bug report and see if that affects you as well.

lxcid · November 22, 2023, 2:25am

yep, gpt 3 turbo 1106 variant are quite unstable, we actively getting timeout, to work around, you might want to reduce your timeout to 30 seconds, the library default timeout is 10 min.

AhmadBarqawi · November 22, 2023, 3:13am

Yes Exactly. But how can I change the timeout value for the api?

lxcid · November 22, 2023, 5:06am

If you are using node library,

github_com/openai/openai-node#timeouts

(can’t paste link, you need to modify the above url.)

Else, you might have to check the library you are using.

rychlis · November 24, 2023, 10:33pm

Also bumped into this and took me a while to figure out it’s just OpenAI doing this.
I have a benchmarking script that will do 100 requests to gpt-3.5-turbo-1106 and calculate how many fail. What is interesting is that it seems that I can only reproduce this with API key on OpenAI account that is also used by a few other clients (in our https://langtail.com/ playground) and it seems like this happens more often if I use playground and do requests from my computer at the same time. Looks like there is some throttling/bug happening when you do requests from multiple places at the same time - easily 5% of the requests fail when i do this.

If I switch to a different API key (from my own personal openAI account), the benchmarking script will consistently succeed with 100% success.

rychlis · November 24, 2023, 10:36pm

BTW from my observation it hangs for much more than 10 minutes - I waited ±1 hour and it never returned any data. 10 minutes is just the default timeout in node-openai

adaptiv · November 25, 2023, 10:48am

That was my main reason to switch to the thread / assistant api. There you can offload the requests to the assistant and reap the response message later. I was hoping that it would reduce the time for our script. And yes, it does that, but with the 3.5 model there is a flaw that it generates more messages and eats into the tokens.

theofficialjeffan · November 26, 2023, 4:44am

I tried setting the timeout value with the Node API (both through the per-request options arg, and the global OpenAI arg), but I still run into cases where my request to open AI just hangs. I’m calling it very simply:

chatCompletion = await openai.chat.completions.create(params, {
          timeout: MAX_API_REQUEST_TIMEOUT_MS,
          maxRetries: 1,
        });

I even tried instrumenting my own timeout wrapper around the api, but it still hangs. Is something busy waiting in the open AI library? My custom timeout code (the last log line is never reached):

    logger.info({}, "right before");
    await Promise.race([
      (async () => {
        chatCompletion = await openai.chat.completions.create(params, {
          timeout: MAX_API_REQUEST_TIMEOUT_MS,
          maxRetries: 0,
        });
      })(),
      new Promise<void>((resolve) => {
        setTimeout(() => resolve(), MAX_API_REQUEST_TIMEOUT_MS);
      }),
    ]);
    logger.info({}, "right after");

This seem to happen every couple of requests.

Foxalabs · November 26, 2023, 5:00am

This has been picked up by OpenAI and is being looked at, hopefully with a resolution soon.

see

theofficialjeffan · November 26, 2023, 5:35am

on further experimentation, i think i have a slightly different issue. my prompt deterministically causes the open AI node library to hang indefinitely - that is why I theorized something was busy waiting in the post above. I can’t share it publicly since it’s proprietary software unfortunately. Can someone from OpenAI help me debug this?

theofficialjeffan · December 2, 2023, 5:57am

fixed as of 11/28, didn’t even have to update the npm package

adaptiv · December 2, 2023, 1:11pm

Yeah, in the last days the performance of the models seem to be better. Noticed the same for GPT-4. Guess they were ramping up their CPU power on MS cloud.

Topic		Replies	Views
New gpt-3.5-turbo-1106 model constantly times out, anyone else? API	20	3197	December 4, 2023
Chat Completion API extremely slow and hanging API	7	5130	December 4, 2023
GPT3.5 Turbo 1106 Just Hangs API	14	2315	December 4, 2023
Assistant API takes long to respond Bugs gpt-4 , api	12	3532	August 27, 2024
Gpt-3-5-turbo-1106 either timeout or gives radically different result from gpt-3.5-turbo-16k API gpt-35-turbo	9	3428	December 4, 2023

Gpt-3.5-turbo-1106 model hangs for 10 minutes every couple of requests

Related topics