We are currently working with the Open AI o4-mini-deep-research model using the Agents SDK as outlined in this example: Introduction to Deep Research API Agents. However, we are frequently encountering the following error:
Rate limit reached for o4-mini-deep-research in organization on tokens per min (TPM): Limit 200000, Used 177684, Requested 33606. Please try again in 3.387s.
To address this, we implemented an exponential backoff retry, but it hasn’t resolved the issue. Even when we retry the API call after a few minutes, we still face the same rate limit problem.
Here are some details:
We are currently on Tier-1.
We are open to upgrading to a higher tier, but I couldn’t find any documentation confirming that moving to a higher tier would increase the TPM limit for this specific model (currently set to 200,000 by OpenAI).
Additionally, we wanted to try implementing this with the o3-deep-research model, but I don’t see it listed under my OpenAI subscription. Currently, I can only see two deep research models available i.e. o4-mini-deep-research and o4-mini-deep-research-2025-06-26
Is o3-deep-research restricted to certain subscription tiers, or is there another reason it might not be accessible?
I’ve attached relevant screenshots for additional context. Has anyone faced similar issues or have insights on how to handle this?
Would upgrading the tier resolve the token-per-minute limit for o4-mini-deep-research, and how can we access the o3-deep-research model?
Any guidance would be greatly appreciated! Thank you.
hey there - may I ask how you managed to implement a backoff inside the agent SDK run function? I cant seem to figure out where to put it. Thanks in advance!
newest payment brings total paid to OpenAI over $50
o3 requires “ID verification”: scan your personal ID and video your face and send it off to a sketchy company, in your platform site “general” section. It seems that ID verify requirement is inherited by the o3 deep research version, also (which can be costly - some new changes should make it closer to $2 per call instead of $20.)
Using Deep Research will cost you a lot of money. It is expensive—a single use can be several dollars.
Let’s say that your first payment is older than two weeks, and you now pay in enough that your total payments to OpenAI are over $100, so you’ve reached tier 3, with TPM=800,000 (on a model that can cost you into the millions per call at $10/M in and $40/M out, because it can “research” for dozens and dozens of tool calls).
How many uses are you actually going to get for your balance you’ve paid in? Not a whole lot. Compare that to $200/month for ChatGPT Pro, which gives you 250/month…where you find you don’t actually have that much to “research” anymore after paying $2,400 in month 13.
The thing is I don’t have any costs racked up on my account right now because apart from deep research, I am only using very light models. And I can’t do any deep research calls because they error out due to rate limits.
Pro is not an interesting option for me as I am specifically looking for API solutions
Sorry for the confusion, took me a while to understand but I got it now. I am contacting support to refresh my usage tier based on the amount already in my wallet. Thanks for the help