GPT-5 reasoning time vs response time

IAmJackHarper · August 9, 2025, 3:55pm

I was trying to figure out what makes GPT-5 so slow and in playground I found that most of the time used is not reasoning but waiting for reasoning to start.
For example
effort: low, verbosity: low, summary: auto
input: 127t
output: 3594t (reasoning 3584t + text 10t )
”Thought for 32 seconds”
Total time 1m 24s
Most of the time is waiting with the 3 dots indicator, what is the model doing in that time? Is it a queue due to high use of new release or is it expected?

_j · August 9, 2025, 9:18pm

You are looking at the time-to-first chunk when you see UI interactivity.

The AI model has to produce some reasoning, enough to talk about.

Then the summarizer that prevents you seeing the true model generation has to abstract that away, an AI generating new language for a progress indication.

There can be additional delays, notably, setting up a context-free grammar when calling a new strict function or structured output response.

The Responses API is an interloper, it and Assistants indeed can behave like a fifo during busy times.

IAmJackHarper · August 10, 2025, 8:46am

I’m wasn’t using functions or structured outputs. As of now, two days since launch, I found it too weak on “minimal” and too slow on “low” for most applications. Didn’t dare to try medium or high.

ny08 · August 10, 2025, 10:07am

I find it very slow. It’s a dissappointing update. I also don’t find the output that useful.

Topic		Replies	Views
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6867	December 16, 2023
Api response time too long API	2	3551	November 2, 2023
Long response time API	9	1176	December 15, 2023
Chat API is slow!, Fix it! API gpt-35-turbo , chatgpt , api	6	2728	December 24, 2023
Is there an issue with GPT 3.5 turbo 16k? API	5	951	October 27, 2023

GPT-5 reasoning time vs response time

Related topics