Does response/generation time of gpt 4 depends on size of input prompt?

Foxalabs · May 30, 2023, 12:50pm

There is a delay related to the size of the input token size, although this is typically only a fraction of the total response time. I do not have access to the 32k model to build any sort of data based model of the time taken, but from the 4k and 8k models of 3.5 and 4 the increase seems small, on the order of a second or so for max token contexts.

Topic		Replies	Views
Benchmarking response time for GPT4 by context+output tokens API gpt-4 , api-speed	6	7015	November 3, 2023
GPT-3.5 and GPT-4 API response time measurements - FYI API	19	38937	February 6, 2024
GPT 4 API taking more time to render things asked through prompts API gpt-4	1	550	September 14, 2023
Anyone know when we can expect to see speed start to pick up for GPT-4? API	0	749	March 25, 2023
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6869	December 16, 2023

Does response/generation time of gpt 4 depends on size of input prompt?

Related topics