Documentation on the assistants-api rate limits

marcus.gomez · August 19, 2024, 4:58am

Hey team, is there any formal documentation for the rate limits on the assistants API? I understand it isn’t the same as the standard GPT limits, but I’m not able to build my own rate limiter without this data.

I’m currently resorting to profiling manually, which kinda sucks.

MrFriday · August 19, 2024, 2:43pm

Unfortunately, there isn’t any clear-cut documentation that outlines the rate limits specifically for the OpenAI Assistants API, especially compared to the standard GPT models.

While the API does have rate limits, the details are somewhat vague, unlike the more straightforward limits you find with the Chat Completion API. Also, you won’t get any rate limit headers from the Assistants API, which makes it tricky to track usage programmatically.

I feel your pain

marcus.gomez · August 26, 2024, 10:52am

Importantly, this behavior seems to be inconsistent too?

For example, sometimes the error I get back says the rate limit is 200 requests / min, other times it says 1000 requests / min

Even further more, I’ve implemented rate limiters that throttle my code to well under <100 requests / min, and I still get the 1000 requests / min issue. I’m utterly lost.

Honestly OpenAI team, would just love some, any kind of documentation

MrFriday · August 26, 2024, 1:59pm

100% Correct. I think there has to be an explanation on the Usage. It might not be for the public domain. If not, then I think OpenAI Models hallucinate on Token Usage too

Topic		Replies	Views
Rate Limits with Assistants API Feedback assistants-api	4	2451	December 3, 2023
RPM rate limits at 100 when using assistants API API	3	1580	September 24, 2024
Assistant API Roapmap? When will this API come out of beta? Feedback assistants-api	7	536	December 16, 2024
Getting rate limit error that specifies incorrect rate limit Bugs assistants-api , api-threads	5	440	June 28, 2024
RPM rate limits at 60 when using gpt-4 with Assistant API API api	3	1289	February 28, 2024

Documentation on the assistants-api rate limits

Related topics