Hello everyone, I have a question regarding the response of o4-mini. We tried prompting in Azure AI foundry playground, and we are using o4-mini. What I have noticed is even with simple questions like “What is the difference between power and authority”. The respond will took 2 minutes and it is just the chain of thoughts and not a complete response. Is there anything that i can do to make it respond faster? Thanks
O4-mini has a parameter ‘reasoning_effort’, where you can trade the expenditure on reasoning generation to speed. The performance will be related to your particular model datacenter deployment of OpenAI services on Azure, something where this forum is not going to be a primary support source.
The endpoint returns a list of response items, so if you only get the first one parsed out, especially using the SDK’s helper response.output
and just getting index:0, you are going to receive a reasoning summary and not content.
2 Likes