Who is gpt-4o agent successor in the gpt-5 serie?

LambertB · October 10, 2025, 8:06am

Hello everyone,

For the past year I’ve been building agents using Azure OpenAI gpt-4o 2024-11-20.

I wanted to upgrade to one of the model of the gpt-5 serie to gain performance notably in SWE and Instruction following. My main use cases are text-to-SQL, python coding and RAG and I require:

Quick response: so very little to no reasoning
Tool call

I found in the OpenAI documentation that gpt-5-chat-latest is the natural successor of gpt-4o:
GPT-5 System Card | OpenAI

Using GPT-5 - OpenAI API

System card name	API alias
`gpt-5-thinking`	`gpt-5`
`gpt-5-thinking-mini`	`gpt-5-mini`
`gpt-5-thinking-nano`	`gpt-5-nano`
`gpt-5-main`	`gpt-5-chat-latest`
`gpt-5-main-mini`	[not available via API]

gpt-5-chat-latest doesn’t support tool call in OpenAI.
gpt-5-chat in Azure OpenAI does support tool call, but tends to not call them much and output everything in the chat (plus the model is in preview).
gpt-5 has a too long time to first token (~40s) for my use case even with reasoning_effort “low” and verbosity “low”

What are you thoughts on that?

sps · October 10, 2025, 8:08am

_j · October 10, 2025, 8:55am

Try o4-mini.

It reasons, but reasons fast, then a rapid token spew across your screen.

It has better-quality understanding than gpt-5, and then you just have to argue about style.

Where’s our o4??

Topic		Replies	Views
Which model is best for speed and accuracy? API gpt-35-turbo , api , python , gpt-4o	8	25633	February 26, 2025
Successor to 4o mini? When? API	9	847	April 5, 2025
Which OpenAI model is the most code oriented? API code	5	11381	July 3, 2025
GPT 4 models vs GPT 5 models API	2	451	November 24, 2025
Serious latency issues while migrating from GPT-4o API	4	169	July 24, 2026

Who is gpt-4o agent successor in the gpt-5 serie?

Related topics