Level 4 - hitting rate limit. How to get temp exception?

dimitris · December 11, 2025, 5:34pm

Good day all

We are launching our new product and need to vectorize a lot of data. We’ve chosen to use the small vectorizing model. Unfortunately, we are hitting the quite low vectorizing limit and this is causing a bottleneck for our org.

Is there any way to get an exception? Otherwise we might be looking at a 2 month delay

Thank you

D

_j · December 11, 2025, 6:03pm

You’ve got $0.10 per minute in spending capped by the rate limiter (or less depending on the estimated tokens), or perhaps $100 per day if you are extremely persistent. In comparison to other models (like Sora 2 where you can spend hundreds per minute), it does seem paltry.

The batches API is a separate pool of rate limit, and adds 100 “minutes worth” to that. Then, if the batch jobs are done quickly and exit the pending queue, that should mean you can achieve more than 500M tokens per 24h promise. You can see that tier-5 is a significant jump in batch queue - still $40/max/queue after discount (over the $5 ensured additional spend you might have per day otherwise in batch with tier-4.)

Thus, I extrapolate that your total desired spend on 3-small embeddings is $6000 if you and I have done our math right. You can pay the additional needed to reach $1000 total if over 30 days since the first payment and graduate to tier 5.

There is no more “request exception” in the platform site any more. You’d have to go to help.openai.com, ensuring login there against your organization owner account, and send a message that explains your need to send OpenAI a lot of money for upfront AI usage to a particular model.

sales@openai.com is another avenue, but you’ll likely only hear back if they prequalify your interest and a Googling as “enterprise partner” material. Also Microsoft Azure services.

Note: this is a significant investment that will only work against a hosted model now two years old…

Topic		Replies	Views
No response from OpenAI re: Rate limit increases for months API api-rate-increase , rate-limit	16	1972	May 22, 2024
OpenAI super high limit for business account Community api	4	779	March 12, 2024
Inquiry About Maximum Rate Limit for GPT-3.5-turbo-16k Model API api-rate-increase , rate-limit	7	1167	November 1, 2023
NeedHelp: Increase usage limit API api-rate-increase , api-billing	5	1338	December 16, 2023
Tier Upgrade Problem. Anyone faced something similar? Bugs	3	343	September 25, 2024

Level 4 - hitting rate limit. How to get temp exception?

Related topics