Clarification on Token Usage for Image Inputs

Where is the token usage for image inputs visible? If I look at the OpenAI dashboard, does the “images” token usage graph only refer to image outputs and not inputs?

Thanks! Sorry if this is basic, I’m a newbie!

3 Likes

Welcome to the forum. No worries, we’re all newbies.

Images are converted into tokens as they are used as inputs, depending on the model.

In the pricing page, roll down to the bottom and you will see a calculator hidden in the FAQ section:

How is pricing calculated for images?

Images are converted into tokens and charged per token. Text models price image tokens at standard text token rates, while GPT Image and gpt-realtime uses a separate image token rate. Models like gpt-4.1-mini, gpt-4.1-nano, and o4-mini convert images into tokens differently. Learn more in our docs⁠(opens in a new window).

3 Likes

A: Vision: exposed and metered nowhere separately.


You get to see “audio” in chat completions usage because it is billed at a different rate per token.

I made a web utility to make the calculation a hair more transparent, and you can enter dimensions or try images:

Discovery

(also ensuring that gpt-5 vision is not overbilling for “low”)

detail:low, 512x513

— Testing
The image is a high-contrast black-and-white checkerboard pattern. It consists of an even grid of alternating black and white squares arranged in rows and columns, with each square the same size. The pattern repeats uniformly across the entire image.

input tokens: 87 output tokens: 57
uncached: 87 non-reasoning: 57
cached: 0 reasoning: 0

detail:high, 512x513

— Testing
The image shows a black-and-white checkerboard pattern filling the entire frame. Squares of equal size alternate between black and white in both rows and columns, creating a grid of repeating checks.

input tokens: 367 output tokens: 46
uncached: 367 non-reasoning: 46
cached: 0 reasoning: 0

“Images” as a usage category is for generations by DALL-E models or gpt-image-1, dedicated to making AI pictures.

2 Likes

@_j @aprendendo.next

Well, all I can say is that gpt Image-1 edits are way too expensive. Actually, this is the only beef I have with OpenAI, so far…

1 Like

True… I hope the sequel for gpt-image-1 gets cheaper, and perhaps a bit faster. The competition is getting pretty close lately.

1 Like