Rate Limit even though Limit not breached

verifast_tanmay · February 2, 2024, 9:36am

Getting following error when invoking chatgpt api:

Retrying langchain.chat_models.openai.ChatOpenAI.completion_with_retry.<locals>._completion_with_retry in 4.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors..
Retrying langchain.chat_models.openai.ChatOpenAI.completion_with_retry.<locals>._completion_with_retry in 4.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors..
Retrying langchain.chat_models.openai.ChatOpenAI.completion_with_retry.<locals>._completion_with_retry in 4.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors..
Retrying langchain.chat_models.openai.ChatOpenAI.completion_with_retry.<locals>._completion_with_retry in 8.0 seconds as it raised RateLimitError: You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors..

Attached screenshot for my actual bill usage:

and quota usage:

MODEL	TOKEN LIMITS	REQUEST AND OTHER LIMITS
gpt-3.5-turbo	80,000 TPM	5,000 RPM

My prod is down since morning due to this. Not sure what else to check/do?

idonotwritecode · February 2, 2024, 10:14am

Have you tried with a lower model? Say GPT3.5? Are you getting the same error?

verifast_tanmay · February 2, 2024, 10:30am

We are getting rate limit on gpt-3.5 turbo only

idonotwritecode · February 2, 2024, 1:10pm

What has worked for me sometimes is to use a backup account. Sometimes it’s just your account and the best thing to do is create a new one.

For my SaaS that’s in Prod, I have two accounts for my API and some logic that switches it over. You could even randomize it.

The feedback above is akin to ‘switch off and switch on’, but sometimes it just works.

verifast_tanmay · February 2, 2024, 1:46pm

The funny thing is this is my backup for Azure openai and that is where it is not supporting.

PaulBellow · February 2, 2024, 1:53pm

Rate limits can be quantized, meaning they are enforced over shorter periods of time (e.g. 60,000 requests/minute may be enforced as 1,000 requests/second). Sending short bursts of requests or contexts (prompts+max_tokens) that are too long can lead to rate limit errors, even when you are technically below the rate limit per minute.

etothepiiminus1 · March 10, 2024, 8:42pm

This is good information: to add something, I am getting rate limit errors, on a prepaid account gpt plus with perhaps 30 gpt4 prompts PER DAY (pennies), and my tokens are useless and cannot use the api due to “reaching my quota” .

PaulBellow · March 10, 2024, 8:43pm

Just to make sure you know… ChatGPT Plus at $20/month doesn’t give you API credits which must be purchased separately.

Topic		Replies	Views
RateLimitError: OpenAI API new user API	15	9569	November 18, 2024
Getting a 429 RateLimitError, quota exceeded API	1	122	April 3, 2025
New Guide on API Rate Limits (Available Now) API	31	14942	December 17, 2023
\ openai.error.RateLimitError: You exceeded your current quota, please check your plan and billing details Prompting	10	15966	December 13, 2023
Dreaded 429 rate limit errors when our usage is well-under the limits API gpt-4 , gpt-35-turbo , chatgpt , account-problem , api	3	3214	June 3, 2023

Rate Limit even though Limit not breached

Related topics