GPT-3.5 API is very slow. Any fix?

_j · October 11, 2023, 10:51pm

Python test code to run (and slowness measure confirmed by another)

I’m still doing well, compare the “latency” of 1 token to a full response of 512:

Title
[1 tokens in 1.0s. 1.0 tps]
Title: Embracing Digital Transformation: Unlocking the Power of the Digital Age

[128 tokens in 1.9s. 67.6 tps]
Title: Embracing Digital Transformation: Unlocking the Power of the Digital Age

[512 tokens in 7.2s. 70.8 tps]

post-pay, Western US.

Unlike other reports of massive slowing in the last few days:

So this is not a “blame on intermittent stuff and the user”.

Although it does appear to be “sticky” to particular users. Reports of where you are geographically connected, whether you are prepay or billing or free trial, whether you ever paid a bill, etc. could help determine why some are affected and some are fast.

Topic		Replies	Views
Chat GPT's API is significantly slower than the website with GPT Plus API	35	36756	December 12, 2023
Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo API	12	9575	July 22, 2024
Let's compare our API speed? It's too slow! API gpt-4 , gpt-35 , gpt-35-turbo , api , playground	9	1912	October 29, 2023
GPT-3.5 API is 30x slower than ChatGPT equivalent prompt API gpt-35-turbo , api	69	13938	November 30, 2023
GPT-3.5 Turbo API response is slow API	20	12439	November 11, 2023

GPT-3.5 API is very slow. Any fix?

Related topics