OCR using API for text extraction

OpenAI massively cranks up the billed tokens artificially to ensure there is no bargain vision.

gpt-4o: 774 input tokens

gpt-4o-mini: 25510 input tokens

However, this also includes non-vision text in your case. Put the actual fixed cost of 85 tokens “low” and 170 tokens per tile through the token cost.

The amplification can be discovered on the API pricing page’s image calculator, forcing you to make the discovery. Producing the same price against gpt-4o-2024-05-13 and double the price for cheaper gpt-4o-2024-08-06.

so tricky as force you to use a different calculator for mini instead of quickly switching:

Mini actually provides the more satisfying answer, instead of “look it up yourself, pal”. Where you should have less confidence in the smaller AI model actually knowing.