Question Regarding TPM Limits Across Separate Environment API Keys

paseli.koba · December 20, 2024, 5:53am

Hello everyone,

I’m currently using separate API keys for my production and staging environments. I’d like to clarify how TPM (tokens per minute) limits are applied. Are TPM limits accounted for independently on a per-API-key basis, or are they shared across all keys under the same account?

In other words, if my staging environment hits its TPM limit, could this also impact my production environment’s access (even though it uses a different API key)? Any insights or guidance from those who have encountered a similar situation would be greatly appreciated.

Thank you!

ーーーーーーー

現在、弊社ではOpenAI APIを運用する際に、本番環境とステージング環境で別々のAPIキーを発行して利用しています。

この場合、TPM（トークン・パー・ミニット）の利用制限は環境ごと（つまりAPIキーごと）に独立してカウントされるのか、それとも同一アカウント内で共有される仕組みになっているのでしょうか？

具体的には、ステージング環境でのテスト中にTPM制限に達した場合、その影響で本番環境側のAPIキー利用にも制限が及ぶ可能性があるのか確認したいです。

ご存知の方がいらっしゃいましたら、アドバイスをいただけますと幸いです。

よろしくお願いいたします。

_j · December 20, 2024, 6:31am

Overall rate limits are on the organization level.

Within individual projects, you can place further rate limits yourself. You might keep an app developer from going to $1000 per hour when five requests per minute are adequate.

There are different groups of rate limits by model. For example, embeddings is a different pool than any completion model. You can look at your rate limit page in the platform account, and infer the shared rate pools of those models that have the same limits.

Hitting a rate limit will produce an API error. If it is the organization limit for a model class, everyone will get the error until the rate algorithm “refills”.

paseli.koba · December 20, 2024, 9:36am

Thank you for clarifying.

From your explanation, I understand that the TPM and related rate limits are indeed applied at the organization level, not just per API key.
This means that if our staging environment hits a rate limit, it will affect production as well.
We’ll keep this in mind and plan our usage accordingly.
Thanks again for your help!

ーーーーーーーーー

ご説明ありがとうございます。

おかげさまで、TPMを含むレートリミットはAPIキー単位ではなく、組織単位で適用されることがわかりました。つまり、ステージング環境で上限に達した場合でも本番環境にも影響が出る可能性があるということですね。

今後はこの点を踏まえた上で利用計画を練りたいと思います。

改めてご助言いただき、ありがとうございました！

_j · December 20, 2024, 10:08am

Another strategy for true isolation is to use a second organization.

“Organization” by name is something that is contained within accounts, but can also be shared with other accounts, or another account can even be made “owner” instead of “reader” for full management (and even takeover).

Each organization has its own billing, and must individually reach payment tier qualifications to have higher usage limits.

Some older accounts had a second “personal” organization added that could be employed for this purpose. At one point in time, you could also request an organization be added by making a request to OpenAI, but I don’t know the current policy. If another account was made owner and took your organization, removing you, then there would be no organization within the original account for API, necessitating requesting a new organization.

Topic		Replies	Views
Multiple Apps with Separate API Keys Under One OpenAI Account: Rate Limits and Usage Tracking API api	4	190	February 18, 2025
Should I create a separate organization for a production service vs development environment? API	5	981	September 3, 2024
Why changing organization in the header affects the Open AI api response speed? API api	7	1852	November 2, 2023
Different gpt-4 level models to mitigate rate limiting issues API	6	1062	September 13, 2023
Is there any problem if i exceed TPM limit? API gpt-4	5	1842	May 20, 2024

Question Regarding TPM Limits Across Separate Environment API Keys

Related topics