Access to API Scale Tier for Production Deployment

Hello,

We represent a company that has developed an AI assistant for specific users. As we prepare to move this service into production, we are looking to improve and guarantee response speed for our users.

We came across the API Scale Tier and are interested in enabling it for our project. However, we could not find clear instructions on how to activate this tier or what the associated costs are.

Could someone from the OpenAI team clarify:

  • How we can request access to the Scale Tier?
  • What the pricing model is?
  • Whether the tier includes SLAs or latency guarantees?

Thank you in advance for your help.

The page answers all your presale questions.

  1. You request access by first being an Enterprise customer. Expect that it is a level beyond the $200k monthly cap that can be reached by API tiers. The top has a link.
  2. The pricing is by guaranteed input output units. For example, one unit of 30k input tokens/minute (equivalent to the rate limit of tier 1, btw), would be $3300 a month, subject to whatever wheeling and dealing your account manager can sell you. That’s a starting point to one-half of the minimum purchase needed.
  3. You will see the word SLA right in the page.
1 Like

Hi @Naimxyz

Welcome to the dev community.

You would need to fill a form to contact sales to learn more about provisioning Scale Tier on your org.

https://openai.com/contact-sales/

Hello,

I have already done this task, but I get an automatic answer " Use chatGPT Team " but the team version doesn’t give access to custom Scale Tier.

I’m going to migrate to Microsoft Azure, since it’s the only way to get a dedicated endpoint or at least provisioning TPU to reduce latency.

Thank you !

I can confirm that contacting sales is the only valid way to obtain the Scale Tier.

This is just a hint, as many enterprise customers are also interested in ChatGPT

You should still be able to submit the form regardless of this hint.

1 Like

No, I was genuinely referring to their response.

Here is the email I received:

1 Like

I understand the confusion. The Scale tier is only available to Enterprise accounts, and based on your inquiry, your company doesn’t meet the criteria for an Enterprise upgrade. The best alternative is a Team account, but this tier doesn’t include Scale access.

@sps has flagged this to the team, and they’ll coordinate with support and sales to improve the process going forward.

It’s probably not the answer you were hoping for, but I hope it clarifies things!

1 Like

Just to confirm @Naimxyz

You selected API for Enterprise and still got that email?

@sps Yes !
@vb Argh, that’s the response I was unfortunately expecting. It’s really frustrating to have to migrate to Azure just to get a dedicated endpoint or even provisioned TPUs.

I really hope OpenAI will offer a solution for Team users, or ideally everyone, that allows for ultra-low latency without requiring a $200k/month spending.

Thank you for your answers :slight_smile:

2 Likes