Rate Limit Advice - update

EricGT · February 1, 2024, 10:32pm

Rate limits can be quantized, meaning they are enforced over shorter periods of time (e.g. 60,000 requests/minute may be enforced as 1,000 requests/second). Sending short bursts of requests or contexts (prompts+max_tokens) that are too long can lead to rate limit errors, even when you are technically below the rate limit per minute.

OpenAI FAQ - How can I solve 429: ‘Too Many Requests’ errors?

As unsuccessful requests contribute to your per-minute limit, continuously resending a request won’t work.

PaulBellow · February 1, 2024, 11:00pm

Good to know. Thanks for passing this along. So many things to keep track of these days, it can be easy to miss something like this!

EricGT · February 1, 2024, 11:04pm

The thanks go to @anon22939549. I just had to scan them as they were created.

PaulBellow · February 1, 2024, 11:07pm

Even that takes time!

"… 195 replies… "

Thanks to all, though.

anon22939549 · February 2, 2024, 2:05am

Things should, in theory, be fixed now so there won’t be trivial updates posted. I just hope Logan gets back and gives the go ahead for a new sub-category and an API key for it.

Topic		Replies	Views
Error: 429 Too Many Requests API	56	14110	December 2, 2023
Currently getting a deluge of 429s in a row API	15	1818	April 16, 2024
Getting 429 errors without hitting limits API	11	3053	December 18, 2023
Hitting Rate Limit with small group of Users? API api-rate-increase	14	6289	January 20, 2024
429 Error on Free API key API	5	2792	December 17, 2023

Rate Limit Advice - update

Related topics