Here a statement from the open ai head of dx:
https://x.com/romainhuet/status/1814054938986885550?t=AMFK4svMvCluYqAXUqRDMQ&s=19
So it seems like it works ( costs ) as intended.
Makes it not really usable for high volume vision tasks. Funny because it is supposed to be a high volume model.
Can recommend Gemini Flash with a fixed cost of $0.0001315 per image vs $0.005525 (768px x 1128px) for gpt-4o(mini) which is around 40 times cheaper while performing great.