Why are requests in batches so much less effective than single API calls?

george-p · December 31, 2024, 8:02pm

We have a lot of different processes we run on old property documents, and all use pretty demanding prompts.

When processing these via API calls, the vast majority are successful, however, when processing the exact same requests in a batch, we get a failure rate ranging from 10% to 70%.

We have consistently experienced this across many different prompts and process types.

The requests used on both, the API calls and in batches, are EXACTLY the same and we’re using the ‘gpt-4o-2024-08-06’ model.

Can anyone please lend some insight on this and what a solution may be?

Thank you.

Topic	Replies	Views
Batch processing gets different results than individual requests API batch-api	160	November 27, 2024
Issues with Rate Limiting and Batch Processing in OpenAI API Community api , batching	1887	November 11, 2023
Processing 4.5K GPT Requests - Batching Prompts API batch	63	December 10, 2024
Batch API instability - randomly failing on 403 Bugs batch-api	100	December 24, 2024
Batch API Jobs Expiring Prematurely Using GPT-4o API gpt-4o , batch-api	167	February 3, 2025

Why are requests in batches so much less effective than single API calls?

Related topics