I have been recently receiving the same error for text-davinci-003 so did a few test. Happy to provide trace logs if OAI team need / want.
According to the latest docs the rate limit for paid account after 48 hours:
3,000 requests / minute
250,000 davinci tokens / minute (and proportionally more for smaller models)
However, I am able to trigger a 429 error with close to 98% reliability by sending 3 or more concurrent requests to the API. It seems that regardless of the size of the requests as soon as 3 (sometimes 4) are being processed in parallel using the same API key the service locks up. So far Iāve tried it with the python SDK, Java and manually using CURL.
I have been getting the same error. Switching to text-davinci-002 makes it works normally.
I think there is some issue with text-davinci-003 model only.
For me I threw a time.sleep(3) in there so I do only 20 requests per minute max. The API limits are somewhere in the documentation, I think itās 3k calls per minute if you have a credit card linked.
Just encountered the same problem on a āpay-as-you-goā plan. I definitely didnāt hit any of the specified limits. Will try to contact the support.