Question about the new Priority processing

wxp041985 · July 25, 2025, 5:00am

Reading through the priority process document, had one question regarding the capacity, can anyone help clarify this?

Assuming if my requests are always within ramp rate limits, and the traffic keep growing, will OpenAI always guarantee that it will have enough capacity to handle my requests in Priority tier? or my requests still might be downgraded to standard tier when the overall Priority tier capacity is not enough?

_j · July 25, 2025, 7:14am

This statement about the enterprise tier program:

If (a) Priority processing performance is degraded AND (b) a customer’s traffic is ramping too quickly, then some Priority requests may be downgraded to Standard processing instead.

It does not say “OR performance is degraded”. If you literally read the statement, then the condition “some Priority requests may be downgraded” cannot be reached by only degraded performance and not also an organization over-ramp (to hint at provisioning needs), and thus would still fall under SLA (which if not met is all of “sorry, our bad, contact sales rep”).

The excess rate is not by day or something reasonable, it is by continuous use by 15 minutes and is violated if you increase by 50% in 15 minutes (vs unmentioned metering period). Starting at 100k TPM to consider to drop you out. So guaranteed good for the speed of barely a few users with large input before this ramping factors in. Comparable to about 3 scale units.

So this does not tolerate any burstyness or variance and you would have to have many users, but the penalty is then just a cheaper standard request.

Overall: this is a price/profit increase for OpenAI having the same compute resources.

vb · July 25, 2025, 8:56am

Hi and welcome to the community!

With regards to this question:

I expect you are mostly interested in edge cases, like a constantly growing demand for a extended period of time. I suggest to reach out to sales@openai.com or your account manager directly because the general marketing page doesn’t have all the answers.

In general I read it as: if everything else stays the same and the ramp limits are not hit, priority processing will commence as expected.

Topic		Replies	Views
Inquiry About Maximum Rate Limit for GPT-3.5-turbo-16k Model API api-rate-increase , rate-limit	7	1139	November 1, 2023
OpenAI "Priority" Tier SLA – No Way to Measure Latency? API api , api-billing	4	893	December 5, 2025
30K Tokens Per Minute limit vs 128K+ Context Models – Is Long-Context Usage Actually Possible via API? API	3	196	November 23, 2025
Access to API Scale Tier for Production Deployment API api , enterprise	8	325	May 22, 2025
Can I use GPT4 preview in production in Tier 4? Documentation unclear API gpt4	1	786	November 28, 2023

Question about the new Priority processing

Related topics