Best practices for handling long queue times with OpenAI's responses API

aymiee.lee · August 20, 2025, 1:45am

Hi everyone,

When using OpenAI’s responses API for deep research models (like o3-deepresearch, o3-pro), long queue times can be an issue. I’ve seen our queues exceeding 45 minutes.

What’s the recommended approach if a job seems stuck in a queue?
If a job hasn’t completed after 45 minutes, is it advisable to abandon it and resubmit an identical job (hoping for a shorter queue), or is there a better best practice for handling this?

I realize that abandoning a job doesn’t cancel it, so there’s a risk of being charged for multiple completions if both eventually finish. Is there any API-level method for canceling a pending/running job to prevent this?

What strategies do others use to manage long queue times other than simply waiting? Any advice on avoiding duplicate charges would be much appreciated.

Thanks for your insights!

aprendendo.next · August 20, 2025, 7:43am

There are various options, the simplest being background mode or batch.
For a more sophisticated approach you can use webhooks.

A background request be streamed, polled or receive a webhook notification when done. It can also be cancelled.

Also related:

Deep research in the API, webhooks, and web search with o3

tilleul · October 4, 2025, 8:27am

On that doc page about background requests, it is also said that terminating a connection will cancel a synchronous request. But doing so will charge you anyway. I was hoping I would not be charged for at least the output tokens.

My question is: are we charged for cancelled background requests ? I suppose at least for the input tokens, what about the output tokens ?

Topic		Replies	Views
“Background” requests latency API	5	260	March 30, 2026
Is "Background" stable for heavy usage? API api , assistants-api , background-mode	2	308	December 5, 2025
Background mode shouldn't delay responses, unless using flex API api , background-mode	1	424	August 25, 2025
What to expect of queue time for running deep research in the API? API deep-research , background-mode	0	467	September 10, 2025
Handling OpenAI API Rate Limits Without Breaking User Experience API rate-limit , best-practices	2	175	June 17, 2026

Best practices for handling long queue times with OpenAI's responses API

Related topics