Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo

logankilpatrick · February 19, 2024, 2:24pm

Hey, this is somewhat to be expected. The GPT-4 series models will always be slower than the 3.5T series models. Which model were you using before, if I am understanding you right, your saying the token generation time went from 1 min to 5 min?

Topic		Replies	Views
Gpt-4-0125-preview is slower than gpt-4-0613? Feedback gpt-4 , api	5	5550	January 30, 2024
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6720	December 16, 2023
GPT-3.5 API is very slow. Any fix? API	31	9866	October 12, 2023
GPT-3.5 Turbo API response is slow API	20	12355	November 11, 2023
GPT-3.5 API is 30x slower than ChatGPT equivalent prompt API gpt-35-turbo , api	69	13855	November 30, 2023

Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo

Related topics