Response: “Rate limit reached for gpt-4o-mini-realtime in organization org-xx on requests per day (RPD): Limit 1000, Used 1000, Requested 1. Please try again in 1m26.4s. Visit xxxx to learn more.”
How to query the rate-limit value of the gpt-realtime-mini model in real time. The example shows that the ‘rate_limits.updated’ event can provide information on the consumption of requests and tokens. However, in reality, it can only return the token consumption, but not the request consumption.
Additional:
Q1:
I was using gpt-realtime-mini model. Why did it show “gpt-4o-mini-realtime” in the response?
Q2:
Does RPD refer to a single natural day (i.e., automatically refreshed at 0 o’clock) or to the past 24 hours?
Q3:
I actually have a large number of requests every day. How can I increase the upper limit of my RPD? After Tier’s upgrade, I noticed improvements in RPM and TPM, but RPD did not show any increase.
Q4:
Are the values of gpt-realtime-mini and gpt-realtime shared or independent? Also, what is gpt-realtime-1.5 ?Is it an upgraded version of GPT-Realtime or does it have slight differences in usage from GPT-Realtime?