GPT-5 sometimes returns intermediate reasoning steps in final `output_text` blocks

jokelly · September 26, 2025, 7:09pm

I am using the openAI responses API to generate text responses with gpt-5, minimal reasoning and low verbosity. When generating a response, rarely the response output_text will include what seems like an intermediate reasoning step, something like: “Ok, now i will check the account. done. checked the account. The account has a certain account type. How do i handle this account type. Ok, i must follow instructions for that account type. Ok, done.
Based on your account type, please […final response]”

It does return the final output text that should be sent, but it’s prepended by some reasoning.

How i’m parsing:

First, messages is defined by taking all elements of response[‘output’] where the output has type == ‘message’ and role == 'assistant'

And then, for each element of messages, and returning each element’s text value if the type == ‘output_text’. Then, I join them all together. I’m guessing it’s something to do with joining them all together, but i’ve got no idea how then i would differentiate between ‘final’ output text and intermediate output text.

I am on ZLR, and i’m finding this issue impossible to reproduce. Is this a prompt issue, or is there an issue with my parsing? It works correctly almost always, but rarely will return what seems like a non-final response above. I haven’t been able to capture a request that causes this, which is driving me nuts. I’ll add more logging on my end to see if i can capture one, but in the mean time, wondering if anyone else has run into this issue.

satwik_dai · November 6, 2025, 6:46am

Hi,
We are facing the same issue, did you find a workaround for this ?
We’ve figured out that whenever it happens, the returned reasoning is empty. Thinking of adding a retry on empty reasoning.

jeffvpace · November 6, 2025, 7:19am

I think you are not parsing the output correctly.

If output[i].type = “message” then output[i].content[0].text

_j · November 6, 2025, 7:41am

I think you are using a poor AI model that starts with “gpt-5” that has no decent controls over the sampling done (to be even cheaper and save math?), and thus, cannot reliably signal “channels” in which to appropriately write its “commentary” and “analysis” with unseen reasoning, vs “final”.

The AI writes the final channel and proceeds to its post-trained thinking-style output anyway.

The only thing to be done is talk directly about what the AI is too poorly trained to know and do in your developer message, to reinforce where “unseen planning” goes, and try to improve the logits. You can read more about the “harmony” ChatML replacement that has trickles of information to inform your communication style.

Then escalate the appearance of authority of “developer”.

You’d have to write a “developer” book in order to cover every edge case of systemic bad model behavior, though.

jokelly · November 6, 2025, 1:50pm

@satwik_dai Are you on zero log retention?

jokelly · November 6, 2025, 1:53pm

We are parsing correctly.

The API will return two message types in the output array (in varying order, so we can’t just pick one) with content[0] populated - one will be the ‘real’ output and the other will be an intermediate reasoning step.

satwik_dai · November 6, 2025, 2:17pm

No, i don’t think so. We didn’t explicitly disable this.

jokelly · November 6, 2025, 2:21pm

We are using the responses API statelessly in accordance with zero data retention. We have not seen the error again since enabling encrypted reasoning items (won’t let me post a link, but it’s in the docs) but I can’t say for sure that resolved it until we have a sufficient sample size. Will follow up if we still see it, and perhaps attempt another solution like the one you proposed above.

EDIT: We have not seen this since enabling encrypted reasoning items. Marking this as the solution.

_j · November 6, 2025, 3:17pm

In a self-managed conversation (“store”:false), passing back the encrypted reasoning for placement may act a bit more like multi-shot in reinforcing to the AI “how to write” its responses in general. Good discovery.

To receive it along with summary in the “output”, with some booleans to turn it off in Python:

"include": [
    *(
        ["reasoning.encrypted_content"]
        if is_reasoning and include_encrypted_content
        else []
    ),
    # .. other include types not demonstrated
],

Chengxuan_Wang · January 23, 2026, 3:19am

Also got this issue. In the output array, the reasoning text seems be marked as msg. so I can’t differentiate between them.
If the reason token is encrypted, how do we output the reason content?

curious how you guys want to output the reasoning text?

pierredemilly · February 6, 2026, 2:36pm

if anyone’s interested, here is the regex logic we apply to fix 100% of these issues
it extracts the actual output from the text

if text.match?(/\(.*\?.*\)|hmm|let's|I'll|must|should|we'll|ok but/i)

  if quoted = text.match(/"([^"]+)"/)

    return quoted[1].strip

  elsif text.include?("→")

    parts = text.split("→").last
    return parts.split(/\(|hmm|let's/i).first.strip

  else

    cleaned = text.split(/\s+(hmm|let's|I'll|must|should|we'll|ok but)/i).first &.strip
    return cleaned if cleaned.present?
    return text # Last resort: return as-is

  end

end

jokelly · February 6, 2026, 2:50pm

Use encrypted reasoning items if you aren’t already. That solved the issue 100% for us with no hacks.

Topic		Replies	Views
How to solve "BadRequestError: 400 Item 'rs_ ... ' of type 'reasoning' was provided without its required following item" error in Responses API API api , responses	16	6352	December 30, 2025
O3 model in API often omits reasoning summary despite reasoning.summary: detailed API reasoning , o3	41	2400	February 2, 2026
Responses API: empty output_text (no message item) when status=incomplete due to max_output_tokens (reasoning-only output) API codex , reasoning , responses-api	8	143	February 8, 2026
What is going on with the GPT-5 API? API	40	17882	October 21, 2025
How differentiate preamble from final response during streaming? API streaming , gpt-5 , agents-sdk	12	1197	December 5, 2025

GPT-5 sometimes returns intermediate reasoning steps in final `output_text` blocks

Related topics