Getting rate limit error that specifies incorrect rate limit

product1 · June 27, 2024, 4:06pm

I am in usage tier 5, and the limits page in settings gives my RPM limit on gpt-3.5-turbo as 10k. However, I am getting this error message fairly frequently:

{
“error”: {
“message”: “You’ve exceeded the 200 request/min rate limit, please slow down and try again.”,
“type”: “invalid_request_error”,
“param”: null,
“code”: “rate_limit_exceeded”
}
}

This error can occur regardless of threads endpoint I call, but in this example the endpoint was:
/v1/threads/runs

Is there something different with the threads API or is this a case of my rate limit not being respected correctly?

_j · June 27, 2024, 4:25pm

The Assistants API has an unmentioned rate limit for actual API calls, perhaps to keep it “beta” for now. What you report is an increase from the long-time limit of 60 requests per minute, which could be exhausted just polling for a response to be completed.

product1 · June 27, 2024, 4:48pm

Does this rate limit apply per API key or per assistant?

_j · June 28, 2024, 12:27pm

It is organization-wide rate limit, and is about the calls to the endpoint, nor the contents.

product1 · June 28, 2024, 1:02pm

Does this same rate limit exist for the chat completions API?

_j · June 28, 2024, 1:27pm

No. To chat completions I do not know of any practical limit where you start to get cut off except for the model-based limits and by encoded tokens.

Topic		Replies	Views
RPM rate limits at 100 when using assistants API API	3	1891	September 24, 2024
RateLimitError (429) on Tier-5 Account While Using GPT-4o-mini – Clarification Requested API rate-limit , assistants-api , api-rate-limits , api-rate-tiers , gpt-4o-mini	1	136	November 17, 2025
RPM rate limits at 60 when using gpt-4 with Assistant API API api	3	1518	February 28, 2024
Keep getting 'rate-limit' error on Assistants API! Bugs api , assistants-api	11	1050	April 29, 2025
Assistant API Limits even on higher tier API assistants-api	1	118	July 3, 2025

Getting rate limit error that specifies incorrect rate limit

Related topics