Token Usage for Images Remains Constant Regardless of Size - Is This a Bug?

_j · September 22, 2024, 7:11pm

I just did a deep dive into what you can expect for token usage (and rate usage) for a variety of resolutions, detail settings, and models.

If the image resolution at detail:high takes the same number of tiles, the cost will be the same. This means anything from 513x513 to 1024x1024, or anything in between, results in 4 overlay tiles (on top of a base “low” image.)

There are also peculiarities in the internal downsizing even on detail:high. Your image will be downsized so the shortest dimension is at most 768 pixels. Send 3000x3000, the model sees 768x768 - 4 tiles of 512x512. Send 2000x500, the model sees 2000x500, also 4 tiles of 512x512.

Topic		Replies	Views
Help understand token usage with vision API API gpt-4-vision	3	567	September 13, 2024
GPT-4-o-Mini Vision Token Cost Issue API gpt-4-vision , cost	1	378	October 23, 2024
Vision token counts does not correspond to the documentation Bugs token , api-vision	3	58	December 30, 2024
Are the vision tokens added to the tokens per request limit? API	4	156	September 16, 2024
One request costs 153491 input tokens Prompting gpt-4o-mini	9	214	January 13, 2025

Token Usage for Images Remains Constant Regardless of Size - Is This a Bug?

Related topics