Facing Rate Limit Issues with o4-mini-deep-research Model

We are currently working with the Open AI o4-mini-deep-research model using the Agents SDK as outlined in this example: Introduction to Deep Research API Agents. However, we are frequently encountering the following error:

Rate limit reached for o4-mini-deep-research in organization on tokens per min (TPM): Limit 200000, Used 177684, Requested 33606. Please try again in 3.387s.

To address this, we implemented an exponential backoff retry, but it hasn’t resolved the issue. Even when we retry the API call after a few minutes, we still face the same rate limit problem.
Here are some details:

  • We are currently on Tier-1.
  • We are open to upgrading to a higher tier, but I couldn’t find any documentation confirming that moving to a higher tier would increase the TPM limit for this specific model (currently set to 200,000 by OpenAI).

Additionally, we wanted to try implementing this with the o3-deep-research model, but I don’t see it listed under my OpenAI subscription. Currently, I can only see two deep research models available i.e. o4-mini-deep-research and o4-mini-deep-research-2025-06-26

Is o3-deep-research restricted to certain subscription tiers, or is there another reason it might not be accessible?

I’ve attached relevant screenshots for additional context. Has anyone faced similar issues or have insights on how to handle this?

Would upgrading the tier resolve the token-per-minute limit for o4-mini-deep-research, and how can we access the o3-deep-research model?

Any guidance would be greatly appreciated! Thank you.