GPT-4 is faster again these days

jwatte · July 19, 2023, 11:04pm

Given all the complaints one or two months ago about gpt-4 inference speed, I thought I’d note that, lately, gpt-4 has been running faster on average than back then.
I don’t know if this is “different roll of the die” or “die-off in load” or “additional provisioned capacity” but, whatever it is, I’ll take it, and thank whoever needs thanking

kelmendival2021 · July 22, 2023, 3:58am

How fast it is? I mean how many seconds it takes to generate articles?

jwatte · July 22, 2023, 4:37am

Generation is directly proportional to number of tokens generated, because each iteration to generate a token is one evaluation through the model.

It’s still clearly slower than gpt-3.5-turbo, but the 40 seconds for a paragraph we used to see (probably queuing time?) are now largely gone.

Topic		Replies	Views
Is GPT-4 Getting Faster? (Latency more than halved in the last 3 months) API gpt-4 , gpt-35-turbo , api	0	649	October 17, 2023
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6633	December 16, 2023
Is gpt4 turbo preview now slower than gpt 4? API gpt-4 , gpt-4-turbo	3	8507	January 23, 2024
Anyone know when we can expect to see speed start to pick up for GPT-4? API	0	744	March 25, 2023
Gpt-4-0125-preview is slower than gpt-4-0613? Feedback gpt-4 , api	5	5503	January 30, 2024

GPT-4 is faster again these days

Related topics