Configuring timeout for ChatCompletion Python

AgusPG · March 20, 2023, 9:19am

In addition to this: I’d also recommend having some sort of fallback strategy. Sometimes a model is completely off but the other ones are working seamlessly. Retrying the same model over and over will not help, but falling back to a different (usually worse) model will do.

In my case, a generic service checks out the model’s health by pinging them every minute, and updates this models’ health in a database. When any of the other services want to call OpenAI’s API using a particular model, they firstly retrieve the model’s health from the database. If it’s ok, they call it with a retry mechanism. If not, they call the best one available at that precise moment (also with a retry mechanism).

Topic		Replies	Views
Frequent API timeout errors recently API	39	48857	December 12, 2023
Timeout for OpenAI chat completion in Python API api , python	6	27383	December 16, 2023
Recommended way to limit the amount of time a Python ChatCompletion.create() runs API gpt-4	8	2451	September 15, 2023
Setting request_timeout in openai v1.2.2 API	3	16602	November 10, 2023
Timeout not honored in Python API? API	11	4295	June 8, 2023

Configuring timeout for ChatCompletion Python

Related topics