Hello!
I have been searching in vain for more details concerning the pricing of the Evals tooling – either via API/github repo, or via the playground. I understand that any tokens consumed are priced at the “regular” rate (although the playground doesn’t specify which model you are using) but it is not clear to me whether there is an additional charge to run a single instance of an eval on top of the cost of token usage.
This is an important variable cost metric for our startup business, as we may look at “rolling our own” by leveraging RAGAS or something if it proves too cost prohibitive to call OpenAI Evals at scale. More importantly, I would like to avoid a scenario in which we build a scalable OpenAI Evals integration and find a bunch of unforeseen charges beyond the token usage cost.
Please lmk! Am impressed by the tooling so far, but the economics need to make sense for us.
Best,
Zack