TPM Rate limited exceeded - why?

frank.beyer · December 31, 2024, 8:58am

Situation:

I have read many threads here on this topic, but still the following basic question remains unanswered for me (and I believe for many other users too).
My token usage during the course of December has reached 10Million tokens on gpt-4o-mini and I am getting the message “rate_limited_exceeded”. Consequently, I can’t send requests anymore even for requests with a low number of total tokens.
My limit (I am at Tier4) is indeed 10,000,000 TPM, but that applies on tokens per minute (TPM) according to the documentation.
By the way, my total costs in Dec is only about $13.

Question:

Why I am exceeding a limit and which one? Again, it is a per minute limit, but appears to be a limit for the entire month.
If I would have indeed exceeded any TPM limit, it should be freed up shortly, let’s say at least within an hour, but it does not. Why?

vb · December 31, 2024, 12:46pm

Hi and welcome to the community!

Have you looked into the rate limit headers to learn more about the situation?

This is a relatively new feature. I expect there are not many old topics referencing it as a debugging tool.

arata · December 31, 2024, 5:20pm

Check your current account balance. Out of funds?

It might be a rate limit error type, but the message text may continue “check your plan and billing details…”

Auto-recharge is not working, likely disabled due to even more severe problems of the function making unneeded charges to a card.

Topic		Replies	Views
Error Code: 429 Rate Limit Differs from Documentation Bugs chatgpt	3	219	February 24, 2025
Rate_limit_exceeded error when we only do one transaction at a time API rate-limit , gpt-4o-mini	7	354	November 19, 2024
Rate limit issue, very confused with results API	4	2736	December 22, 2023
I don't know where where my tokens are being used. I think it is wrong API gpt-4 , api , gpt-4-turbo	12	1982	December 10, 2023
Rate limiting but I've run nothing... and I'm getting charged - what's up? API assistants-api	3	1204	February 8, 2024