About gpt-realtime Rate-Limit

xu_wt · February 28, 2026, 4:25am

Response: “Rate limit reached for gpt-4o-mini-realtime in organization org-xx on requests per day (RPD): Limit 1000, Used 1000, Requested 1. Please try again in 1m26.4s. Visit xxxx to learn more.”

How to query the rate-limit value of the gpt-realtime-mini model in real time. The example shows that the ‘rate_limits.updated’ event can provide information on the consumption of requests and tokens. However, in reality, it can only return the token consumption, but not the request consumption.

Additional:

Q1:

I was using gpt-realtime-mini model. Why did it show “gpt-4o-mini-realtime” in the response?

Q2：

Does RPD refer to a single natural day (i.e., automatically refreshed at 0 o’clock) or to the past 24 hours?

Q3:

I actually have a large number of requests every day. How can I increase the upper limit of my RPD? After Tier’s upgrade, I noticed improvements in RPM and TPM, but RPD did not show any increase.

Q4:

Are the values of gpt-realtime-mini and gpt-realtime shared or independent? Also, what is gpt-realtime-1.5 ?Is it an upgraded version of GPT-Realtime or does it have slight differences in usage from GPT-Realtime?

Topic		Replies	Views
What is the api rate limit for gpt-4-1106-preview? API gpt-4 , api	3	1115	February 5, 2024
Regarding rate limit in multi model API	3	1234	October 25, 2023
Gpt-4o rate limits being 30k tpm and 90k tpd does that mean I can only do 90k tokens in a day? API rate-limit	1	407	November 23, 2024
Whats up with the 100 uses per day on 1106-preview? API gpt-4	4	2171	November 7, 2023
Tts-1 & tts-1-hd API RPM and RPD based on chosen tier API api	2	469	May 28, 2024

About gpt-realtime Rate-Limit

Related topics