Background mode shouldn't delay responses, unless using flex

pachocastillosr · July 11, 2025, 2:11am

Hello,

I’ve tested the Responses API with background: true to offload long-running, high-reasoning tasks (3–7 min) without blocking client memory but it disappointed me.

Jobs that via the Completions API took 3-5 minutes, under the background mode sit in a queue and only start after ~18 minutes. This is under the default service_tier (full price), not flex mode (lower price per token in exchange of delayed responses).

This queueing decreases so much background mode’s value in my opinion. Background jobs should start immediately by default, and only queue if using the lower-cost flex tier. We shouldn’t have to pay priority tier (which is only for enterprise customers) for this.

OpenAI, please consider this.

Thank you.

Adam_Allcock_Rokt · August 25, 2025, 1:22am

I’m also experiencing this. Is the only workaround to use priority mode?

Topic		Replies	Views
“Background” requests latency API	5	260	March 30, 2026
Is "Background" stable for heavy usage? API api , assistants-api , background-mode	2	309	December 5, 2025
Best practices for handling long queue times with OpenAI's responses API API	2	771	October 4, 2025
Responses background mode insta-fails with structured outputs Bugs api , gpt-5	0	193	August 26, 2025
Background mode requests stuck in 'queued' forever - Responses API API responses , background-mode	42	1780	November 18, 2025

Background mode shouldn't delay responses, unless using flex

Related topics