My request are getting throttled back

george-p · November 21, 2024, 4:07am

My request are getting throttled back even though I’m not near my limits

My request are getting throttled back even though I’m not near my limits.

I’m at tier 4, so I have 2M/TPM and 10K/RPM.

I ran some tests where:

I started off small running only 10 requests @ 5 concurrent requests.

The best performance was a batch of 60 requests @ 20 concurrent requests.

At that mark I was doing about 500K TPM.

After that, each batch was 200 requests. I found that after about 60 to 70 requests, thing slowed down significantly.

No matter how I configured it, I could not process more than 338K TPM.

It sure would be nice if open AI would give us some insight into this as I see other people experiencing similar problems.

Below is a table with the results.

Topic		Replies	Views
Concurrent request restriction API gpt-4	9	1266	February 7, 2025
GPT-4 RPM Bugged at around 4 to 9 RPM API	2	735	July 13, 2023
Token/Tier Limits for account API gpt-4	0	120	December 2, 2024
RPM rate limits at 60 when using gpt-4 with Assistant API API api	3	1239	February 28, 2024
GPT4 rate limit got lower? Bugs rate-limit	1	109	November 13, 2024