Unexpected High Token Usage on OpenAI API

ideabank1027 · January 26, 2025, 12:14pm

Hi I’m experiencing an unusual token usage situation and seeking insights.

Current Usage Status:

Dashboard total tokens: 77,720
Input tokens: 77,056
Output tokens: 664

Actual API Requests (curl commands):

First Chat Completion Request:

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer [REDACTED]" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "Analyze the image and extract a prompt ..."
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "..."
            }
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

Second Chat Completion Request:

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer [REDACTED]" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "Analyze the image and extract a prompt ..."
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "..."
            }
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

Observations:

Actual logged requests are significantly lower than reported usage
Requests were primarily chat completions
Each request averaged around 1,000 tokens

Questions for Community:

Has anyone experienced similar unexpected token consumption?
Are there known billing quirks with recent API updates?
Recommendations for accurate usage tracking?

Appreciate any insights or experiences you can share.

Diet · January 26, 2025, 12:28pm

Welcome to the community!

This has been a very common point of confusion.

The problem is that you’re using the mini model.

Here’s an authoritative source.

I couldn’t find it mentioned anywhere in pricing (https://openai.com/api/pricing/) - the only way you could potentially figure it out if you use their vision pricing calculator, and then back-calculate the tokens from that.

The cost calculation in the api doc (https://platform.openai.com/docs/guides/vision#calculating-costs) seem to be completely irrelevant

Topic		Replies	Views
Unexpected Token Discrepancy in GPT-4o Mini Vision Billing vs. API Usage Bugs api	2	308	February 5, 2025
GPT-4-o-Mini Vision Token Cost Issue API gpt-4-vision , cost	2	972	March 26, 2025
Large Discrepancy in Output Token Size Between Two Identical GPT-4o-mini Batch Runs Bugs batch-api	5	76	May 16, 2025
Unexpected High Token Usage in GPT-4o API Response API gpt-4 , gpt-4-vision , gpt-4o	3	588	September 6, 2024
Unexpectedly High Token Count When Using Image Inputs with gpt-4o-mini API	3	211	April 1, 2025

Unexpected High Token Usage on OpenAI API

Related topics