Confusion Regarding Tokenization Calculation in Realtime API and Potential Double Charging Concerns

eric797 · October 8, 2024, 6:18pm

Many people complain about expensive charge on realtime API, I also confused about tokenization calculation in API.

I got the first response message from realtime API: “Hi there! How can I assist you today?”

It says 20 output tokens.

      "output_token_details": {
        "text_tokens": 20,
        "audio_tokens": 47
      }

but it is 10 by this tool https://platform.openai.com/tokenizer.

They doubled it, and double charge us?

Attached is an event of “response.done” from realtime API

{
  "type": "response.done",
  "event_id": "****",
  "response": {
    "object": "realtime.response",
    "id": "*****",
    "status": "completed",
    "status_details": null,
    "output": [
      {
        "id": "*****",
        "object": "realtime.item",
        "type": "message",
        "status": "completed",
        "role": "assistant",
        "content": [
          {
            "type": "audio",
            "transcript": "Hi there! How can I assist you today?"
          }
        ]
      }
    ],
    "usage": {
      "total_tokens": 577,
      "input_tokens": 510,
      "output_tokens": 67,
      "input_token_details": {
        "cached_tokens": 0,
        "text_tokens": 510,
        "audio_tokens": 0
      },
      "output_token_details": {
        "text_tokens": 20,
        "audio_tokens": 47
      }
    }
  }
}

anon22939549 · October 8, 2024, 6:53pm

There are two things at play here,

They appear to possibly be using a slightly different tokenizer than o200k_base. This tokenizer requires about 30% more tokens for the same amount of text.
You need to include the control tokens which delineate which messages come from which entity (system, user, assistant). These are only on the order of about 4 or so tokens for each message, but on small messages they represent a sizable proportion of the total tokens.

In short, no, you aren’t being double billed.

jeffsharris · October 15, 2024, 3:12pm

Would love more info if you’re continuing to see problems. One thing that’s helpful to check is using the Playground, you can see the total tokens of each type consumed during your entire session at the top of the Logs

Omar_Atef · May 5, 2025, 11:03am

I just said with my voice to my MICOPHONE hello!!
where do these text 11800 tokens come from??

Topic		Replies	Views
Huge tokens charging for text input using Realtime API while only AUDIO input! API	0	55	May 5, 2025
Help me understand the true cost of the RealTime API API api , realtime	2	1217	March 26, 2025
Confusion Between Per-Minute Audio Pricing vs. Token-Based Audio Pricing API realtime	3	5512	December 30, 2024
Realtime API cost mismatch between the bill and the calculated cost API realtime	1	236	May 14, 2025
Lets break down the input/output token details together! API realtime	3	1296	October 6, 2024

Confusion Regarding Tokenization Calculation in Realtime API and Potential Double Charging Concerns

Related topics