How to track complimentary tokens?

dreamscapemexico · March 22, 2025, 8:15pm

Hello everyone. Few days ago i saw new offer from openai to developers. which offer me “Get free usage of up to 250 thousand tokens per day across gpt-4.5-preview, gpt-4o and o1, and up to 2.5 million tokens per day across gpt-4o-mini, o1-mini and o3-mini on traffic shared with OpenAI through April 10, 2025. Usage beyond these limits, as well as usage for other models, will be billed at standard rates” which is super cool, but.

How i can track this complimentary tokens? it says look at pricing page, but there is nothing helpfull, only normal usage pricing.

Lisa-Marie · March 23, 2025, 8:51am

Yeah, I also wonder. I signed up for the same program but so far could not find any information where to track the token usage and how many tokens are available for each model.

What I like that it gives the opportunity to explore the expensive GPT 4.5 … very few clients seem interested in deploying this expensive, big and therefore slow model. However in my tests so far I was indeed quite surprised about its emotionall intelligence which could allow for use cases in health application, as AI therapist / coach / lifestyle trainer / trauma cauncellor and could be build into Apps that doctors can use to prescribe digital health applications, for example an assisants that helps with keeping a diet or tasks from a therapist, powered by GPT 4.5 (Turbo?) … and payed for by the insurance. This an application I can see and it is pretty cool that I can try GPT 4.5 for free but I would love to know how many tokens can I process with GPT 4.5 within the the allowed free tier until this programs run?

Any info is much appreciated. Thanks!

_j · March 23, 2025, 2:09pm

You can go to the legacy usage dashboard, go to activity, pick ALL the models of a particular free usage class (1M or 10M mini), and then get a general idea of your usage patterns, as the token counts there show up even though the usage is free:

However, besides the considerable lag, you cannot “track usage”, which is billed when you exceed the threshold in any 24 hour period.

Tracking? Start here:

print(json.dumps(json.loads(response_completed_event)["response"]["usage"], indent=2))

{
  "input_tokens": 383,
  "input_tokens_details": {
    "cached_tokens": 0
  },
  "output_tokens": 72,
  "output_tokens_details": {
    "reasoning_tokens": 0
  },
  "total_tokens": 455
}

The calls that got me something to show you also fulfilled an add-on to log usage from your API call response object (needing adaptation to the particular API call method, endpoint, or SDK plus way you’d store), and then a utility for the free usage considered right now.

Complementary usage logging

This is an AI brainstorm, expect form, not working code.

Recommended Implementation Strategy:

Logging: Append each API call’s usage data as a JSON line to a log file (free_calls_log.txt). JSON lines (.jsonl) format is ideal for easy parsing and appending.
Utility: A standalone script to parse the log, filter entries within the last 24 hours, sum tokens, and optionally clean expired entries.

1. Logging Function (to append usage data):

import json
from pathlib import Path
from openai.types.chat.chat_completion import ChatCompletion

LOG_FILE = Path("free_calls_log.txt")

def log_usage(response: ChatCompletion) -> None:
    """Append usage data from OpenAI response to log file."""
    entry = {
        "created_at": response.created,  # UNIX timestamp
        "model": response.model,
        "usage": {
            "input_tokens": response.usage.prompt_tokens,
            "output_tokens": response.usage.completion_tokens,
            "total_tokens": response.usage.total_tokens,
        }
    }
    with LOG_FILE.open("a", encoding="utf-8") as f:
        f.write(json.dumps(entry) + "\n")

2. Utility Script (to calculate usage and optionally clean expired entries):

import json
import time
from pathlib import Path
from typing import Set

LOG_FILE = Path("free_calls_log.txt")
WINDOW_SECONDS = 86400  # 24 hours

# Define model groups explicitly
HIGH_CAP_MODELS: Set[str] = {"gpt-4o", "gpt-4.5-preview", "o1"}
LOW_CAP_MODELS: Set[str] = {"gpt-4o-mini", "o1-mini", "o3-mini"}

def get_model_group(model_name: str) -> str | None:
    """Identify model group based on model name."""
    for prefix in HIGH_CAP_MODELS:
        if model_name.startswith(prefix):
            return "high"
    for prefix in LOW_CAP_MODELS:
        if model_name.startswith(prefix):
            return "low"
    return None  # Model not in free usage groups

def calculate_usage(clean_expired: bool = False) -> dict[str, int]:
    """Calculate total token usage within the last 24 hours."""
    now = int(time.time())
    cutoff = now - WINDOW_SECONDS
    totals = {"high": 0, "low": 0}
    valid_entries = []

    with LOG_FILE.open("r", encoding="utf-8") as f:
        for line in f:
            entry = json.loads(line)
            created_at = entry["created_at"]
            if created_at >= cutoff:
                model_group = get_model_group(entry["model"])
                if model_group:
                    totals[model_group] += entry["usage"]["total_tokens"]
                valid_entries.append(entry)

    if clean_expired:
        with LOG_FILE.open("w", encoding="utf-8") as f:
            for entry in valid_entries:
                f.write(json.dumps(entry) + "\n")

    return totals

if __name__ == "__main__":
    usage = calculate_usage(clean_expired=True)
    print(f"Usage in last 24 hours:")
    print(f"High-cap models (1M/day): {usage['high']} tokens")
    print(f"Low-cap models (10M/day): {usage['low']} tokens")

Usage Example:

Logging (after each API call):

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello"}]
)
log_usage(response)

Checking Usage (run periodically or manually):

python calculate_usage.py

This implementation provides:

Efficient append-only logging.
Fast parsing and filtering by timestamp.
Optional cleanup of expired entries.
Clear separation of model groups and their respective caps.

bhoover · March 23, 2025, 5:58pm

You can read more about the program here - scroll to the bottom: https://help.openai.com/en/articles/10306912-sharing-feedback-evals-and-api-data-with-openai

Please note that different usage tiers (Tiers 1-2 v. 3-5) have different per-day token limits.

S_P1 · July 24, 2025, 4:52pm

Hopefully someone can help explain, as a Tier-3 user I am nowhere near the daily usage limit and yet the usage page is showing is showing a charge (i’m using o3-mini and o4-mini). Please help.

_j · July 24, 2025, 5:05pm

Seems like the first step is to use “enabled for all projects”, so you make maximum use of the free daily tokens, and don’t have multiple projects needing further enrollment. The exception is if you are running user applications promising not to train on proprietary data, or if you want to maximize the use of “tokens” to those models that are more expensive, by enabling it only in particular projects.

There is no direct tracking of free consumption. You can only look at the combined call count and infer, as usage billing in dollars only shows what is billed. You can use “per project” views to see when you are going over if you have segregated that free use.

The “day” now resets at 00:00 UTC, instead of being a sliding 24hr window.

S_P1 · July 24, 2025, 5:16pm

Thanks for the response. I only have one project “Default” and am aware of the UTC reset. It was working properly for the default project until today. The only 2 models being charged are o4-mini and o3-mini (the ones I am experimenting on). Any additional advice would be appreciated.

_j · July 24, 2025, 5:22pm

Definitely hit the “enabled for all projects” then.

Track the usage object returned in every API call.

Just a few chat API calls, especially with internal tools, retrieval, and reasoning, can now blow through 1M tokens pretty quickly. The paradigm is now “models that generate a lot and continue to make iterative calls”: internal speculative context generation at high token-per-second instead of large skilled models.

You can certainly report “whoa, bug” if the free usage seems to not be working at all. Make some large calls against your 10M tokens of gpt-4.1-mini and other mini models not currently being tested, and see if that separate pool is also billed.

S_P1 · July 24, 2025, 6:56pm

So, this is not a bug. I figured out that my code was not passing in the project id and I had improperly assumed that the no project id meant default. Thanks for your help.

Topic		Replies	Views
Issue Understanding Free Tier Token Usage and Billing in OpenAI Dashboard API api	3	1009	May 20, 2025
Script to track free token usage on traffic shared with OpenAI? API	1	198	April 5, 2025
How to Determine Token Usage Per Call and Total Post-Run in OpenAI API? API	0	656	May 15, 2024
I received a few million tokens for free usage but I can't see any trace of it Community api-billing , api-billing-problem	1	178	March 30, 2025
How can i check OpenAI usage with Python? API	16	22230	July 29, 2023

How to track complimentary tokens?

Recommended Implementation Strategy:

1. Logging Function (to append usage data):

2. Utility Script (to calculate usage and optionally clean expired entries):

Usage Example:

Related topics