Hi there, I ran a test with ‘gpt-3.5-turbo’ and I had 403 API requests totaling a token count of 142k context/input and 73k generated. The bill says $0.36.
The Pricing says ‘gpt-3.5-turbo-0125’ costs $0.0005/1k input and $0.0015/1k out. With these numbers I arrive at $0.16 and not $0.36. To $0.36 I get if I use the numbers for the ‘gpt-3.5-turbo-instruct’ ($0.0015/1k, $0.002/1k).
What really confused me was that I can not call the OpenAI endpoint with ‘gpt-3.5-turbo-0125’, it returns
ValueError: Unknown model ‘gpt-3.5-turbo-0125’. Please provide a valid OpenAI model name in:
[…]
gpt-3.5-turbo
gpt-3.5-turbo-16k
gpt-3.5-turbo-1106
gpt-3.5-turbo-0613
gpt-3.5-turbo-16k-0613
gpt-3.5-turbo-0301
gpt-35-turbo-16k
gpt-35-turbo
gpt-35-turbo-1106
gpt-35-turbo-0613
gpt-35-turbo-16k-0613
[…]
So what is going on here? Is the pricing homepage outdated or is the backend not up to date with the homepage or am I doing something wrong?
Took a while to catch on, but:
OpenAI also just updated their pricing page so it can be seen in units of 1M…
You have to scroll down to find the higher prices of “older models” in their own section.
They need to state clearly if fine-tune price reductions announced DevDay apply only to the cheaper -1106 and -0125 models. Or also if all fine-tunes have the same inference cost.
Or if right below the June 2024 shutdown notice of a better performing gpt-3.5-turbo-0613, fine-tune cost doesn’t matter because: “Fine-tuned models created from these base models are not effected by this deprecation, but you will no longer be able to create new fine-tuned versions with these models.” (you can still choose -0613 in the UI, but I haven’t ran one…)
Or more fine print, about the most accomplished gpt-3.5-turbo-0301 and gpt-4-0314 models:
As of 01/10/2024, only existing users of this model will be able to continue using this model.
This text from the pricing page continues to be incorrect: