Gpt-5.4 ignores reasoning_effort="none" when max_completion_tokens is used

amitq1324 · April 2, 2026, 10:52am

Thanks for confirming. I looked into the Responses API, but unfortunately it doesn’t support n > 1, which my app needs for generating multiple options at once.

For now, I’m just dropping max_completion_tokens entirely from my chat completions call so it doesn’t break. It’s a bummer losing the hard cap on costs though, so hopefully the team patches the main endpoint soon.

Topic		Replies	Views
GPT-5 on "minimal" - Serious anomaly in prediction token billing and output delivery failure Bugs gpt-5 , reasoning	1	213	August 13, 2025
128k output tokens spike on narrative generation calls on gpt-5.2 using Chat Completions API gpt-5-2	3	79	April 12, 2026
Responses API: empty output_text (no message item) when status=incomplete due to max_output_tokens (reasoning-only output) API codex , reasoning , responses-api	8	193	February 8, 2026
Issue with reasoning tokens being used when using the "gpt-5.1-chat-latest" model Bugs gpt-5	4	422	February 6, 2026
Stop sequence doesn't work with gpt-5.1-chat-latest Bugs chat-completion	5	553	November 24, 2025

Gpt-5.4 ignores reasoning_effort="none" when max_completion_tokens is used

Related topics