I’m doing some testing to see if 3.5 Turbo is financially viable for my use case, and I have some confusion about the cost I’m seeing. Specifically, it seems there’s a discrepancy between the cost compared to the final official token usage.
The token usage is 48k input and 292 output. For all of my testing I have been using the new 0125 model, which is supposed to be $0.0005 per 1k input tokens. The thing is, my cost for today is $0.13, which would make it $0.0027 per 1k input tokens, which is 5x the advertised cost.
The thing is, I know I generated more than 292 output tokens today. Is it possible that the discrepancy could be there, and that the usage statistics aren’t accurate for what I’m actually using? If so, how are we supposed to do any type of testing or estimations of future costs?
What have been your experiences with pricing of the new model, and just pricing discrepancies in general?