GPT-4 API to slow when you have to work with a 46 second time out

_j · July 30, 2023, 8:47am

The only OpenAI solution to language token generation speed is to move the customer-facing AI to gpt-3.5-turbo.

You can see the recent improvement in completion time of 250 tokens of GPT-4 (top blue) that corresponds almost exactly with the load reduction of “GPT-4 no longer making long outputs” complaints.

Topic		Replies	Views
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6844	December 16, 2023
Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo API	12	9633	July 22, 2024
GPT-4 API slow response over 60sec API	6	2602	February 16, 2024
Assistant API Performance is Very Slow API plugin-development , api	10	5378	March 7, 2024
GPT-4 extremely slow compared to 3.5 API	15	8555	December 17, 2023

GPT-4 API to slow when you have to work with a 46 second time out

Related topics