Image token counts in GPT-4.1 don't match docs

adamd · April 23, 2025, 1:02pm

In the documentation (https://platform.openai.com/docs/guides/images#cost-calculation-examples) there is an example that claims that a 1024x1024px image passed to GPT-4.1 should consume 1024 input tokens. However if I generate and pass an image that size to the API the response JSON claims 772 tokens were passed (see test code below). I believe that either the documentation or the calculation in the API is incorrect.

The 772 figure matches very closely to the suggested figure for GPT-4o of 765 in the docs, and in fact if I do the same test switching the model to GPT-4o I get an identical 772.

Meanwhile if I do the same test with 4.1-mini or 4.1-nano I get a result which is very close to the documented 1024x1.62 and 1024x2.46 respectively. So it seems that only GPT-4.1 deviates from the documentation, not GPT-4o, GPT-4.1-mini or GPT-4.1-nano.

Any suggestions appreciated.

Reproducible test as of today:

def generate_solid_color_png(width, height, color, output_path):
    """
    Generates a solid-colour PNG.
    - width, height: dimensions in pixels
    - color: colour (e.g. "#RRGGBB" or (R, G, B))
    - output_path: where to save the PNG
    """
    image = PIL.Image.new("RGB", (width, height), color)
    image.save(output_path, format="PNG")

def send_image_for_completion(width, height, color, model="gpt-4.1", detail="high"):
    # 1. Create the PNG
    tmp = "temp.png"
    generate_solid_color_png(width, height, color, tmp)

    # 2. Read & Base64-encode
    with open(tmp, "rb") as f:
        b = f.read()
    data_url = "data:image/png;base64," + base64.b64encode(b).decode("utf-8")

    # 3. Wrap it in a content block
    response = client.chat.completions.create(
        model=model,
        messages=[
            {
                "role": "user",
                "content": [
                    {
                        "type": "image_url",
                        "image_url": {
                            "url": data_url,
                            "detail": detail
                        }
                    }
                ]
            }
        ]
    )
 
    # 4. Inspect usage
    print("Prompt tokens:", response.usage.prompt_tokens)
    
send_image_for_completion(1024, 1024, '#00FF00')

Outputs: Prompt tokens: 772

_j · April 24, 2025, 9:32am

OpenAI has changed the documentation to fit what seems to be billed, rather than addressing why such is being billed.

8 tokens for 128x64? Nope, now the minimum of 85 tokens and you must set detail:low to achieve that.

Now the other models have a multiplier but the original gpt-4.1 price that multiplier is based on is gone.

That also ruins experiments like sending the model 32000 x 32 strips = 1000 tokens of absolutely determined semantic information. Unless the experiment is to see if you get patches quality of understanding at a 2000x2 bill. (= 4 tiles)

Topic		Replies	Views
Vision token counts does not correspond to the documentation Bugs token , api-vision	3	195	December 30, 2024
Unexpected Token Discrepancy in GPT-4o Mini Vision Billing vs. API Usage Bugs api	2	314	February 5, 2025
Unexpectedly High Token Count When Using Image Inputs with gpt-4o-mini API	3	238	April 1, 2025
Large Discrepancy in Output Token Size Between Two Identical GPT-4o-mini Batch Runs Bugs batch-api	5	83	May 16, 2025
GPT-4.1 vision price calculations -- incorrect billing on full model Bugs bug , gpt-4-vision , gpt-41	7	366	April 24, 2025

Image token counts in GPT-4.1 don't match docs

Related topics