Token per minute rate limit for GPT4 issues

_j · September 8, 2023, 5:24pm

Yes, max tokens are also counted and a single input denied if it comes to over the limit. You can get a rate limit without any generation just by specifying max_tokens = 5000 and n=100 (500,000 of 180,000 for 3.5-16k).

The rate limit endpoint calculation is also just a guess based on characters; it doesn’t actually tokenize the input.

Topic		Replies	Views
Inputs tokens limit, data extraction API gpt-4 , gpt-35-turbo , api , token , rate-limit	2	6169	February 3, 2024
Rate limit reached for 10KTPM-200RPM API gpt-4 , gpt-35-turbo	10	6437	October 24, 2023
Please explain the Tokens per minute metric API	1	5387	January 21, 2024
How do I get token limit per MINUTES and when it will reset? API gpt-4	2	2317	December 30, 2023
Reproducable GPT4 Rate limit bug Bugs	5	1050	November 8, 2023

Token per minute rate limit for GPT4 issues

Related topics