Hello everyone,
For the past year I’ve been building agents using Azure OpenAI gpt-4o 2024-11-20.
I wanted to upgrade to one of the model of the gpt-5 serie to gain performance notably in SWE and Instruction following. My main use cases are text-to-SQL, python coding and RAG and I require:
- Quick response: so very little to no reasoning
- Tool call
I found in the OpenAI documentation that gpt-5-chat-latest is the natural successor of gpt-4o:
GPT-5 System Card | OpenAI
| System card name | API alias |
|---|---|
gpt-5-thinking |
gpt-5 |
gpt-5-thinking-mini |
gpt-5-mini |
gpt-5-thinking-nano |
gpt-5-nano |
gpt-5-main |
gpt-5-chat-latest |
gpt-5-main-mini |
[not available via API] |
gpt-5-chat-latestdoesn’t support tool call in OpenAI.gpt-5-chatin Azure OpenAI does support tool call, but tends to not call them much and output everything in the chat (plus the model is in preview).gpt-5has a too long time to first token (~40s) for my use case even withreasoning_effort“low” andverbosity“low”
What are you thoughts on that?
