Azure OpenAI Assistant API: Retrieve API Fails Due to Rate Limit Exceeded Error

kajal.mishra170 · June 4, 2024, 9:34am

Using the code run = client.beta.threads.runs.retrieve(thread_id=thread_id, run_id=run_id), the response returns:

LastError(code='rate_limit_exceeded', message='Rate limit is exceeded. Try again in 16 seconds.')

Only 2-3 out of 10 prompts are successful.

Could you please help understand when the rate limit exceeded error is thrown and how to avoid it?

Topic		Replies	Views
OpenAI thread run API incredibly slow API gpt-4-turbo , threads , assistants-api	2	349	August 4, 2024
Run.get('status') returns None after ~100 requests API gpt-4	0	468	February 22, 2024
Are there any rate limits when using GPT-4 through the API? API	2	1457	December 15, 2023
Assistant stuck at 'Rate limit is exceeded, please try again in XX seconds" API	1	415	September 6, 2024
RateLimitErrors increased drastically in the last month? API gpt-4 , api	3	634	May 23, 2023