Client-side rate limiting

harjot.gill · October 27, 2023, 2:43am

This is regarding the rate limiter OpenAI has deployed within CloudFlare. They don’t run a tokenizer (e.g. cl100k_base) while calculating rate limits. Tokenization happens after the rate limits are enforced. If you are using actual token calculation to figure out rate limits then you will be quite off.

PS: We tried working with tokenizer (tiktoken) and our limit estimates were quite off. With character_count/4 that is pretty much exact.

Topic		Replies	Views
Hitting Rate Limit with small group of Users? API api-rate-increase	14	6316	January 20, 2024
Rate limit error Tier 2 Account Rate Limit Issues with gpt-3.5-turbo API gpt-35-turbo , api , rate-limit , api-billing , api-rate-limits	6	7518	January 2, 2024
Rate limit reached for 10KTPM-200RPM API gpt-4 , gpt-35-turbo	10	6159	October 24, 2023
Rate Limit Error: Minute and Daily Limit API gpt-4 , api	8	5673	January 9, 2024
I don't know where where my tokens are being used. I think it is wrong API gpt-4 , api , gpt-4-turbo	12	2011	December 10, 2023

Client-side rate limiting

Related topics