Completion Endpoint Randomly Freezes

derek8bai · March 20, 2023, 12:05am

Does anyone else run into the issue where “open.Completion.create” just hangs and doesn’t return anything sometimes?

ruby_coder · March 20, 2023, 2:06am

No really. However, performance is model dependent.

The completion API (in general) performs much faster than the new chat completion API.

Maybe you can post your code so we can see it?

Thanks

derek8bai · March 20, 2023, 2:43am

Sure! I’m just using this. But sometimes (maybe 10% of the time) it just never returns a response and it times out.

 response = openai.ChatCompletion.create(
        model="gpt-3.5-turbo",
        messages=[
            {"role": "system",
                "content": "You are an intelligent, helpful, good teacher explaining to a student. Answer as concisely as possible."},
            {"role": "user", "content": f"{questionText}"},
        ],
        temperature=0,
        max_tokens=150,
        top_p=0.5,
        stream=True,
        frequency_penalty=0,
        presence_penalty=0
    )

ruby_coder · March 20, 2023, 2:51am

Well, we learned a lot from you code, @derek8bai

You are talking about the chat completion endpoint and not the completion endpoint and you are using the gpt-3.5-turbo model,

It is well know these new chat completion models are very stressed with new users pounding on them and so they are very slow.

HTH

RabbitHam · March 20, 2023, 9:31am

I’m having the exact same issue as @derek8bai. I’m also using the ChatCompletions endpoint with gpt-3.5-turbo with it hanging randomly. I’ve implemented a retry with exponential backoff, and it still doesn’t want to work. This is impacting the public image of our service. What other solutions have people found?

marc.torsoc · June 5, 2023, 2:13pm

In my case, a time.sleep(2) solved the issue. I cannot say 100% but I was running a loop over 100 prompts to send, and it would always hang at some point. Very rarely reaching the 50th iteration. After adding the sleep command, it is now ending the loop

analysis · August 31, 2023, 11:02am

I have stumbled upon the same exact issue.
Trying to make calls concurrently and it always hangs up and never exists.
Has anyone found a solution to this?

_j · August 31, 2023, 11:33am

Your issue is likely unrelated to the topic. A coding problem. Code you haven’t mentioned.

Big code example on running parallel tasks.

github.com

openai/openai-cookbook/blob/main/examples/api_request_parallel_processor.py

"""
API REQUEST PARALLEL PROCESSOR

Using the OpenAI API to process lots of text quickly takes some care.
If you trickle in a million API requests one by one, they'll take days to complete.
If you flood a million API requests in parallel, they'll exceed the rate limits and fail with errors.
To maximize throughput, parallel requests need to be throttled to stay under rate limits.

This script parallelizes requests to the OpenAI API while throttling to stay under rate limits.

Features:
- Streams requests from file, to avoid running out of memory for giant jobs
- Makes requests concurrently, to maximize throughput
- Throttles request and token usage, to stay under rate limits
- Retries failed requests up to {max_attempts} times, to avoid missing data
- Logs errors, to diagnose problems with requests

Example command to call script:
```
python examples/api_request_parallel_processor.py \

This file has been truncated. show original

Using await asyncio.sleep() so tasks can actually run?

evan.lesmez · September 29, 2023, 6:54pm

Have the same problem and unlike @_j I don’t think it is a “coding problem”.
Made a simple translation prompt and tried increasing the number of text inputs to translate.
On the 46th request the gpt-3.5-turbo chat completion hung for 10 minutes before timing out.

The OpenAI rate limit documentation states that I get 90000 tokens per minute and 3500 requests per minute.
The 46 requests and likely only 1000-2000 tokens used for my test should not cause an issue.
Think something else is going on.

pavel · October 26, 2023, 2:39am

I have encountered the same problem, ChatCompletion just gets stuck with some content and that’s it. And I have not exceeded the rate limits

Topic		Replies	Views
Chat Completion API extremely slow and hanging API	7	5177	December 4, 2023
GPT 3.5-Turbo API call randomly hangs indefinitely API	10	3971	July 18, 2024
ChatCompletion API Call - HANGS without producing response API gpt-35-turbo , chatgpt , api	5	3270	December 16, 2023
GPT-3.5-turbo stalling out and not responding API	1	1423	November 1, 2023
GPT3.5 Turbo 1106 Just Hangs API	14	2331	December 4, 2023

Completion Endpoint Randomly Freezes

Related topics