300 tokens for a 512x512 image at "detail: low"

boolcount · February 11, 2025, 9:47pm

I use sample from documentation(https://platform.openai.com/docs/guides/vision):

from openai import OpenAI
client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What's in this image?"},
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://my.site....",
                        "detail": "low",
                    },
                },
            ],
        }
    ],
    max_tokens=10,
)

print(response.choices[0].message.content)

i use this image(512x512)

BUT!!!
In the response I see:

prompt_tokens: 303
completion_tokens: 10
total_tokens: 313

Why 303 tokens?

On the same page of the documentation it says 85 tokens for a 512x512 image and “low detail”
https://platform.openai.com/docs/guides/vision

Why 303 tokens?

Topic		Replies	Views
Conflicting Info About the Cost of detail:low images Documentation api	4	1015	March 6, 2024
What is the token cost for image prompt in GPT-4o? Prompting gpt-4 , token , image-reading	2	3191	June 6, 2024
Understanding GPT-Vision API pricing? API	1	2604	May 12, 2024
Unexpected Vision Pricing Bugs gpt-4 , api	1	953	May 9, 2024
Cost of Vision using GPT-4o API api , pricing , gpt4-vision , gpt-4o	1	15562	May 27, 2024

300 tokens for a 512x512 image at "detail: low"

Related topics