Batches don't work at all

ikh201995 · October 26, 2024, 1:26pm

The Chat GPT batch is not working at all. It used to work fine with the same code, but now it’s not functioning at all and has been down for three days now.

I thought there might be an issue with my code, so I tried uploading previously validated data directly on the Open API website, but that isn’t working either.

After waiting 24 hours, I get an “Expired” error, with the message: "error": {"code": "batch_expired", "message": "This request could not be executed before the completion window expired."}}.

I requested assistance through messages, but they haven’t responded. A few other people seem to be experiencing similar issues, but there’s no solution in sight. What should I do?

model: gpt-4o-mini

ikh201995 · October 26, 2024, 1:51pm

After conducting multiple tests, it seems that this issue only occurs with the “gpt-4o-mini” model. The more expensive models work fine. However, my business is still in its early stages, and I can’t afford the cost of the pricier models, so operations are currently on hold.

exykai · October 26, 2024, 4:29pm

I, too, am having issues with batch API with gpt-4o-mini. Normal chat completion works. gpt-4o batch API works. Just not batch API with gpt-4o-mini

binhverse · October 27, 2024, 6:09am

Yes, it’s stopped about 2 or 3 days ago until now, I have to switch to the completion until OpenAI fix it

james90 · October 28, 2024, 2:39pm

Same for us too. gpt-4o-mini batch not working at all. We tried a test batch with gpt-3.5-turbo-0125 and that processed fine so it does indeed seem to be model-specific.

All I see on the incidents page is a mention of “Increased latency in the API on GPT-4o mini” which was apparently resolved on 17th October.

It’s disappointing when we have valuable business processes dependent on this. Due to this (plus a previous batch taking >24hrs and being cancelled) we are now starting to investigate using Gemini as an alternative or back up. They offer a batch service through their API.

anon10827405 · October 28, 2024, 2:56pm

Based on this line in the documentation:

Batches that do not complete in time eventually move to an expired state;

It seems implied that there’s no guarantee that the batch will complete. The only guarantee given is that your batched items will be held in a queue for a total of 24-hours.

In all LLM batch systems you are in some sense gambling for an open window of low-demand for that sweet 50% discount.

If there is no low-demand in a 24-hour period then you have to try again.

If batching is a big part of your process it may be useful to batch single items throughout the day to gauge a rough understanding of what to expect.

james90 · October 28, 2024, 4:06pm

Thanks @anon10827405 - and yes I have the same understanding as you that the 24hr ‘SLA’ has no guarantee. It was just unexpected for us.

I like your idea about periodically testing the batch API for response measurement. Clever.

cc4 · October 29, 2024, 8:39am

I’m having the same issue, support doesn’t seem to care.

I just started using GPT API, is that common or it’s just bad luck?

I understand the nature of the cheap/not-granted service, but I need some reliability

sreeram.v · November 1, 2024, 12:39am

I have same problem for gpt-4o since last 2 days. The batches get expired after 24 hour window.

daren2 · November 4, 2024, 3:43pm

Hey folks, I had encountered the issue as well, then I realized that it is still working, but only for some requests with short prompts.

A batch like the following would be able to complete within a couple minutes:

{"custom_id": "request-1", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "gpt-4o-mini", "messages": [{"role": "system", "content": "You are a helpful assistant."},{"role": "user", "content": "What's the capital of Japan?"}],"max_tokens": 1000}}
{"custom_id": "request-2", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "gpt-4o-mini", "messages": [{"role": "system", "content": "You are a sarcastic assistant."},{"role": "user", "content": "What's the capital of US"}],"max_tokens": 1000}}

However, the major type of prompts that I am working with right now are much longer the ones appeared in the above example (and my batches tends to have 100-200 entries as well)

bobartig · November 4, 2024, 5:31pm

Maybe I’m spoiled in North America, but I send my batches in the evenings and weekends, usually after 8pm on weekdays, west coast. I’ve never had a job take even one hour to complete. Most are 10-20 minutes tops. My batches are usually 500-5000 calls each, perhaps I’m not hitting it as hard.

One thing I will recommend is always use full model strings w/ Batches API. I’ve encountered some model-string bugs recently, and you want to get the right models completing your calls. Although, given that there is only one snapshot model of 4o-mini, that one shouldn’t cause problems, theoretically…

jcadenapro · November 4, 2024, 8:25pm

You’re absolutely right—I tried your requests, and they finished within a few minutes. I’ve already reached out to support, but I haven’t received any help yet. It’s reassuring to know that shorter requests still work, but I really need a solution for longer ones. Everything was working fine just a week ago.

gokulraya · November 5, 2024, 11:02pm

Hey team, thanks for flagging this. I’m investigating this now, will update here once I have more info.

gokulraya · November 14, 2024, 11:09pm

Update: This was fixed last week and should be resolved now. Can you please confirm?

viniciusccarvalho · November 25, 2024, 3:28pm

I still face this issue. It is quite frustrating. I found the “sweet” spot to be around 8AM EST. I’m still on tier 1, and only because I can’t really get to tier 2 because I keep getting daily limits.
My batches are all limited to a max of 2000 requests, and average on 250-300k tokens.
I never submit more than 1.8M tokens a day, and results are mixed. Some days all of them complete within 20-30 min, other days, I get 1 out of 5 and the others just expire

konbraphat · November 25, 2024, 9:30pm

Me too.
I am trying to run a batch of 178 generation task by gpt-4o-mini-2024-07-18, but the progress doesn’t move from 0/178

viniciusccarvalho · November 27, 2024, 12:13am

I can confirm that now I will always only get two batches started. All other batches just freeze in there forever, needing to be manually cancelled and if retried it the pattern repeats (first two starts, rest need to be cancelled)

jcadenapro · November 27, 2024, 9:19pm

I tested the models, and only gpt-4o runs successfully. However, a batch with gpt-4o-mini fails to run. @gokulraya , do you have any updates on this by any chance?

ajoemer · November 28, 2024, 10:39am

This is still an ongoing issue, especially for gpt-4o-mini over the past 6 days, but batches with gpt-4o run successfully. @gokulraya, can your team take a look again

Topic		Replies	Views
Gpt-4o-mini batch api still bugged? Bugs batch-api	8	239	May 7, 2025
BatchAPI Jobs Failing/Expiring after 24 hours with no progress? API api , batch	5	1607	October 28, 2024
OpenAI batch API gets stuck for hours with status `in_progress` API gpt-35-turbo , api	10	3131	September 25, 2024
Batch API is took so long API batch-api	10	3408	October 28, 2024
Gpt-4o Batch Processing Jobs Response Time Increased Significantly, Causing Job Timeouts/failures API batch , batch-api	2	385	February 3, 2025

Batches don't work at all

Related topics