Incomplete API responses due to "max_output_tokens" limit during batch processing

Deebak_Tamilmani · August 21, 2025, 7:34am

I’m experiencing an issue when using the responses API where outputs are returned with "status": "incomplete" and "reason": "max_output_tokens", even though my max_output_tokens is explicitly set to 25000, which follows OpenAI’s recommendation.

Interestingly, this issue does not occur when using the completions endpoint, even when max_tokens is only 1024 .

Batch body args:

Model: gpt-4o-mini-2024-07-18
Endpoint with issue: responses API (batch)
max_output_tokens: 25000
temperature: 0.5
Error status: "status": "incomplete", "reason": "max_output_tokens"
background: false (i.e., this is a blocking request)

Question:

Why is the responses API prematurely terminating even with such a high token limit?

_j · August 21, 2025, 3:48pm

Let’s investigate the model:

16k max output tokens.

The API calls should be failing on you.

A better message (like the rate limit and API validator sends back for a normal call) would be helpful as a batch return.

The batch API should not be running the calls at all; the endpoint should be returning an error.

BTW, there is no “set globally”. You have to construct individual complete API calls as JSON lines, each with their own parameters. I’ll assume it is just a miscommunication, and you are doing that.

Deebak_Tamilmani · August 22, 2025, 8:26am

I had been using the responses endpoint for my batch jobs, but this issue started occurring recently, likely after the GPT-5 release. I’ve now switched back to the completions endpoint, and it works fine without any errors.

Deebak_Tamilmani · August 24, 2025, 4:48am

In responses, failed requests show the reason clearly in the response. But in completions, even if something goes wrong, the request is marked as completed and no error is shown.

scott.weeden · August 24, 2025, 3:54pm

I am also receiving this error. I pay $200 per month and I can’t even get a single API to call. It always says max_output_tokens.

You can request a refund at support@openai.com

Deebak_Tamilmani · August 28, 2025, 1:42am

Try completions endpoint. It does not fail as much as responses

Topic		Replies	Views
Batch API error- "Request blocked"/"Invalid completion" API batch , batch-api , deep-research , responses-api	0	102	August 16, 2025
Hitting max output token limit for 4.1-mini API gpt-4 , api , responses , gpt-41-mini	2	856	July 28, 2025
Structured output on batch-api giving incomplete results Bugs batch-api	3	345	January 25, 2025
Assistant API run status incomplete with max_prompt_tokens when max_prompt_tokens is NOT set API assistants-api	3	1414	June 26, 2024
[BUG REPORT] batch error: input is larger than the 209,715,200 maximum Bugs batch , batch-api	3	367	February 14, 2025

Incomplete API responses due to "max_output_tokens" limit during batch processing

Batch body args:

Question:

Related topics