How to handle expected large rate limits

unubar · July 21, 2023, 11:53pm

Hi,

I am developing a web app that will be used by a large number of users. The web app will be making calls to OpenAI for each user’s interaction. How do I avoid hitting rate limits? Is there a way to increase it or is this a case where I should move to using Azure OpenAI?

Regards,
Ubaidullah Nubar.

_j · July 22, 2023, 12:41am

If you anticipate usage bursts of more requests or more tokens than your limit for sustained periods of more than a minute or two, there is nothing for you except request a higher limit.

Within a fixed limit, the only thing you can do with your software that is tracking its output rate is to start queueing or start reporting “too busy” if too deep, and also handling the possible API errors you still get by running near the limit.

vb · July 22, 2023, 5:49am

For actual production you should switch to the OpenAI services offered via Microsoft Azure.

As far as I can tell OpenAI is not trying to be more than the provider of the models powering your app. Instead Microsoft handles the scale and Open AI develops and maintains the models.

Topic		Replies	Views
Hitting Rate Limit with small group of Users? API api-rate-increase	14	6557	January 20, 2024
Queries about generating multiple requests at a time on davinci model and increasing the token limit API	2	2925	December 22, 2023
Inquiry About Maximum Rate Limit for GPT-3.5-turbo-16k Model API api-rate-increase , rate-limit	7	1097	November 1, 2023
Upgrade Monthly Usage Limit API api-rate-increase	5	983	December 16, 2023
NeedHelp: Increase usage limit API api-rate-increase , api-billing	5	1274	December 16, 2023

How to handle expected large rate limits

Related topics