Azure OpenAI o4-mini slow respond

Jerrick_James_Decena · May 15, 2025, 2:03am

Hello everyone, I have a question regarding the response of o4-mini. We tried prompting in Azure AI foundry playground, and we are using o4-mini. What I have noticed is even with simple questions like “What is the difference between power and authority”. The respond will took 2 minutes and it is just the chain of thoughts and not a complete response. Is there anything that i can do to make it respond faster? Thanks

_j · May 15, 2025, 2:49am

O4-mini has a parameter ‘reasoning_effort’, where you can trade the expenditure on reasoning generation to speed. The performance will be related to your particular model datacenter deployment of OpenAI services on Azure, something where this forum is not going to be a primary support source.

The endpoint returns a list of response items, so if you only get the first one parsed out, especially using the SDK’s helper response.output and just getting index:0, you are going to receive a reasoning summary and not content.

Topic		Replies	Views
How to control o3-mini chat model without returning "One moment please" API chatgpt , azure-openai , o3-mini	3	600	March 11, 2025
Openai api so slow recently Feedback api	0	229	February 27, 2025
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	3	22863	November 9, 2023
How to reduce OpenAI response time? API	13	17794	December 13, 2023
Gpt-4o-mini is really slow API gpt-4o-mini	6	2770	March 18, 2025

Azure OpenAI o4-mini slow respond

Related topics