Regarding rate limit in multi model

jake8131 · October 25, 2023, 9:19am

I have a question about rate limit.

lets suppose I am a tier 1, Tier 1 is like below
$5 paid $100 500 RPM, 10K RPD 40K TPM (GPT-3.5), 10K TPM (GPT-4)

if gpt4 hit the limit, then how about chatgpt3.5??

When GPT-4.0 reaches the rate limit, does GPT-3.5 also get automatically rate-limited? Or do the rate limits for these two models operate independently, such that even if one is rate-limited, the other continues to function normally?

b0zal · October 25, 2023, 10:23am

Every model has its own rate limit.

Therefore, you can still use ChatGPT 3.5 even if you’ve reached the limit for GPT-4.0.

_j · October 25, 2023, 10:48am

That’s actually not the case. If you consume well over your rate limit (by, for example, a set of parallel GPT-4 calls where you gave no max_token for the limiter to estimate), you’ll have locked yourself out of chat models until the “percentage over” resets, carried over multiple minutes.

You can blast off a few dollars worth of long completion GPT-4 all at once and verify for yourself. Or you can just believe me without me needing to get the corroborating evidence off the forum.

b0zal · October 25, 2023, 10:50am

My software uses the LLM API methods, which don’t necessarily require the maximum token count. So, in this scenario, it largely depends on the input we provide.

Topic		Replies	Views
Different gpt-4 level models to mitigate rate limiting issues API	6	1043	September 13, 2023
Inquiry About Maximum Rate Limit for GPT-3.5-turbo-16k Model API api-rate-increase , rate-limit	7	1033	November 1, 2023
Is max token limit per endpoint or model? API gpt-35-turbo , chatgpt , rate-limit	1	1510	June 29, 2023
Inputs tokens limit, data extraction API gpt-4 , gpt-35-turbo , api , token , rate-limit	2	4154	February 3, 2024
Organizational API Rate Limit Q: also PER MODEL? Scenarios inside... 👀 API api	1	726	August 15, 2023

Regarding rate limit in multi model

Related topics