Is max token limit per endpoint or model?

mikeyshiran · June 21, 2023, 7:06am

From the docs:
It is important to note that the rate limit can be hit by either option depending on what occurs first. For example, you might send 20 requests with only 100 tokens to the Edit endpoint and that would fill your limit, even if you did not send 150k tokens within those 20 requests.

But since different models have different token rate limits (16k has x2 the token limit), will I effectively be able to use 270k tokens per minute for 3.5 (90k for 3.5 and 180k for 3.5-16k)?

What about number of requests? (I assume it’s per endpoint, so shared, but still)

Thanks,
Miki

jochenschultz · June 29, 2023, 9:05pm

As far as I know it happens even randomly. Gonna do some tests to get the limits on some tonight.

Topic		Replies	Views
Regarding rate limit in multi model API	3	919	October 25, 2023
Maximum token allowed for chat gpt model gpt 3.5 turbo API chatgpt	3	2521	February 15, 2024
Token per minute rate limit for GPT4 issues API rate-limit	7	9915	December 22, 2023
Tokens limit gpt-3.5-turbo-0125 API token , gpt-0125	1	3456	February 15, 2024
Please tell me the maximum number of tokens for GPT-3.5-turbo-1106 API api , token , pricing	4	18217	January 15, 2024

Is max token limit per endpoint or model?

Related topics