I’m using gpt-4-1106-preview.
Starting around… December 4th, 5pm ET or so, I’ve been getting closed connections from OpenAI API. Some debug output indicates that requests are no longer allowed to take more than 60 seconds. This is tricky, since,well, I need the API to do the work in order to, well, work with it.
I am sending, if I count correctly (rust tiktoken library), about 1500 tokens, but that’s not changed from what I used to do.
Is this an intentional change on the OpenAI side? A Cloudflare configuration bug? An unintended config deploy side effect from a new SRE team member? Is my account somehow capped (that would surprise me, I am not a power user by any stretch of the imagination).