Scaling OpenAI API for production use

Neel0312 · July 29, 2024, 9:20am

Hi Everyone!

I am building a product for my company that heavily relies on OpenAI APIs for doing Data summarisation and formatting.
I am extracting the RAW test from my file (PRFs) and then using that text to call the chat completion API to summarise and format the data

Problem - OpenAI APIs are not reliable enough to use it on production as it fails sometimes and sometimes refuses to generate any answers even though I am well within the token and API rate limits.

Solution that I came up with - Use multiple OpenAI Accounts (let’s say 3) and internally route the requests between them.

Question for Community -

Does this approach work, or I should try some different approach?
Will OpenAI Block me for this?
Any better suggestions to make the use of OpenAI APIs more reliable for use in production.

Thank you.

merefield · July 29, 2024, 10:14am

NO! That’s a terrible idea.

This is not a new thing. This is a standard engineering challenge when external APIs are adopted and you don’t have direct control over their availability.

I have two websites in Production that rely heavily on Open AI APIs as well a widely adopted Chatbot and an open source summarisation plugin for Discourse.

The solution is to turn your calls into batch Jobs that automatically retry until a successful response is returned.

I use Sidekiq for this purpose which is excellent.

Neel0312 · July 29, 2024, 10:27am

Hey @merefield thanks for your suggestion. The batch jobs sounds like a good idea.

Thank you.

merefield · July 29, 2024, 10:27am

Great. It’s a bit of extra work but will save you a lot of stress and micromanagement in Production.

Note the standard retry lead time grows exponentially usually which is a really nice algorithm as can handle availability problems of different extents.

Best of luck with the project!

Topic		Replies	Views
Handling OpenAI API Rate Limits Without Breaking User Experience API rate-limit , best-practices	1	65	December 29, 2025
How Are You Managing OpenAI API Costs in Large-Scale Apps? API api-costs	1	97	December 28, 2025
How to Handle Rate Limits When Building a Chatbot with OpenAI API API api , rate-limit , chatbot	4	316	September 15, 2025
Best Practices for Reliable Embeddings Pipeline API	1	1169	July 11, 2023
Best Practices for Using OpenAI API Efficiently in Production API	0	86	December 22, 2025

Scaling OpenAI API for production use

Related topics