Configuring timeout for ChatCompletion Python

itai.marks · March 24, 2023, 1:43pm

You can use this client implemented with asyncio and httpx.
It supports fine grained connect/read timeout setting and connection reuse.

from httpx import Timeout
from openai_async_client import AsyncCreate, Message, ChatCompletionRequest, SystemMessage, OpenAIParams

create = AsyncCreate(api_key=os.environ["OPENAI_API_KEY"])
messages = [
    Message(
        role="user",
        content=f"ChatGPT, Give a brief overview of the Pride and Prejudice by Jane Austen.",
    )
]
response = create.completion(ChatCompletionRequest(prompt=messages),client_timeout=Timeout(1.0,read=10.0),retries=3)


create = AsyncCreate()
response = create.completion(TextCompletionRequest(prompt=f"DaVinci, Give a brief overview of Moby Dick by  Herman Melville."))

RonaldGRuckus · March 25, 2023, 3:49am

I completely recommend granular timeouts, but in a general use-case it makes absolutely no sense. Especially in the context of an OpenAI request.

I’d say it’s like recommending sport car parts to someone who just wants to fix their Honda Civic (fantastic car btw).

lmm333 · April 5, 2023, 6:02pm

The easiest way is to add parameter request_timeout, it will be pass to requests.post(timeout=xxx)

eg:

openai.ChatCompletion.create(
model=“gpt-3.5-turbo”,
messages=[
{
“role”: “user”,
“content”: prompt,
}
],
request_timeout=60,
)

RonaldGRuckus · April 5, 2023, 6:07pm

This is a great way.

I’d just like to add that a retry / backoff library is also a great option. In the event that a timeout, or some sort of intermittent error occurs, it will automatically retry using dynamic intervals.

jmtexas19 · April 6, 2023, 4:02am

This worked for me along with using Ronald’s suggestion to use the retry with it. Thanks!

hongda.bu · July 27, 2023, 10:54am

Yes, request_timeout is very important. Most people tell how to write retry decorator, but it can not solve the problem of SLOW. Set this parameter will help a lot.
Meanwhile, use other Parallel method will help when you do not reach you RPM of your account, Like pool.apply_async().
In conclusion, retry decorator+request_timeout parameter+parallel method will accelerate your chatgpt application.

brunouy · September 6, 2023, 1:30pm

This worked perfect for me. Easy and straightforward.

Topic		Replies	Views
Frequent API timeout errors recently API	39	47864	December 12, 2023
Timeout for OpenAI chat completion in Python API api , python	6	24210	December 16, 2023
Recommended way to limit the amount of time a Python ChatCompletion.create() runs API gpt-4	8	2294	September 15, 2023
Setting request_timeout in openai v1.2.2 API	3	13567	November 10, 2023
Timeout not honored in Python API? API	11	4131	June 8, 2023

Configuring timeout for ChatCompletion Python

Related topics