Batches incorrectly failing due to input token quota even though I'm not even close

giancarlo.todone · December 25, 2024, 2:16am

I rely heavily on batching to perform some experiments.
I am currently in tier 2, but sometimes -seemingly at random- some batches fail for exceeding quota.
I wrote some code to correctly estimate input tokens of my batches and split them when required - input tokens are computed with tiktoken and include system message and JSON schema for structured output. I also deliberately stay below 50% of my quota, but I still get failures due to it.
Lots of messages on this forum just say “try later - it could be past batches still in the way”, but I made sure to check and all I have in the list of my batches was failed or cancelled (not “cancelling”) old batches.
I even tried waiting for about an hour and then launch batches that try to stay below 25% of my quota, still to no avail. I’m starting to believe there must be some bug.
Random thought: could the server side tokens estimator be thrown off by some particular encoding?

Topic		Replies	Views
The "enqueued tokens" bug is still active Bugs gpt-4	3	242	December 3, 2025
Enqueued token limit reached API batch-api	26	2093	August 17, 2025
Embedding batches: randomly getting "Enqueued token limit reached for text-embedding-3-large" for rather small batches Bugs batch-api	1	205	February 18, 2025
Batch Failed - Enqueued token limit reached API gpt-4 , api	0	589	December 23, 2024
Batch jobs sent to GPT-5 keep getting rejected due to "1,500,000 enqueued tokens" limit even though no other job is running API	0	78	October 1, 2025

Batches incorrectly failing due to input token quota even though I'm not even close

Related topics