Hi everyone,
I’m currently integrating the GPT-4 API into my application. However, whenever I send a certain number of requests, I often encounter this error: 429 Too Many Requests
I’ve already tried adding a sleep
between requests, but the error still persists.
My questions are:
-
Is there an official rate limit per minute/hour for each account?
-
What are the best practices to avoid this error? Should I implement a queue system?
-
Are there any differences in rate limits between GPT-4.1 and GPT-4o?
If anyone has experienced something similar or has a solution, I’d really appreciate your help
Thanks in advance!