What does 7 Free Weekly Evals actually mean?

lukas3 · July 22, 2025, 5:34am

I’d like to raise some concerns and possible bugs I’ve encountered with the current evals system.

According to the stated policy, users are supposed to receive 7 free weekly evals (excluding tool-use models). However, I’ve noticed that I was billed for some eval runs even though I hadn’t exceeded this weekly limit. Since I use a wide range of models, including GPT-4.5, these runs have sometimes come with unexpectedly high costs. Additionally, I’ve observed that billing sometimes starts partway through a run. I’m not sure if there is a token limit for a single eval run, or if this is simply a delay in the billing process.

Overall, the program feels quite opaque. As an academic researcher working on evals, I was genuinely excited about this program. However, not being able to see how many free evals remain or the cost of each run makes it very challenging to manage my usage and avoid unforeseen charges. Unfortunately, this lack of transparency has already resulted in fees of about $1,000, which is a significant amount for a PhD.

I believe that greater clarity would benefit both users and OpenAI. I hope these issues can be addressed to make the system more accessible and user-friendly for the research community.

aprendendo.next · July 22, 2025, 11:39am

I’m also curious about how these work.

In my previous experiences, I was either charged as a separated thing (billed), or deduced in the already granted ‘free tokens on traffic shared with OpenAI’.

In the later though, I don’t remember the exact details but it messed with the total quota, ending up with me exceeding it and being billed. The amount was too small to make a complain though, in my case.

Topic		Replies	Views
OpenAI Evaluation is not free Community api	14	2121	October 7, 2024
Pricing details RE: Evals feature Feedback gpt-4 , api , pricing , cost , evals	3	461	April 10, 2025
Unexpected $17 Charge via Quick Eval – No Warning Displayed Feedback api , api-billing-problem , evals	3	310	May 26, 2025
Issue Understanding Free Tier Token Usage and Billing in OpenAI Dashboard API api	3	2576	May 20, 2025
Discrepancy between billed tokens and evaluation usage stats Bugs	0	74	July 23, 2025

What does 7 Free Weekly Evals actually mean?

Related topics