Gpt-3.5-turbo API pricing

thellert · February 2, 2024, 9:18pm

Hi there, I ran a test with ‘gpt-3.5-turbo’ and I had 403 API requests totaling a token count of 142k context/input and 73k generated. The bill says $0.36.

The Pricing says ‘gpt-3.5-turbo-0125’ costs $0.0005/1k input and $0.0015/1k out. With these numbers I arrive at $0.16 and not $0.36. To $0.36 I get if I use the numbers for the ‘gpt-3.5-turbo-instruct’ ($0.0015/1k, $0.002/1k).

What really confused me was that I can not call the OpenAI endpoint with ‘gpt-3.5-turbo-0125’, it returns

ValueError: Unknown model ‘gpt-3.5-turbo-0125’. Please provide a valid OpenAI model name in:
[…]
gpt-3.5-turbo
gpt-3.5-turbo-16k
gpt-3.5-turbo-1106
gpt-3.5-turbo-0613
gpt-3.5-turbo-16k-0613
gpt-3.5-turbo-0301
gpt-35-turbo-16k
gpt-35-turbo
gpt-35-turbo-1106
gpt-35-turbo-0613
gpt-35-turbo-16k-0613
[…]

So what is going on here? Is the pricing homepage outdated or is the backend not up to date with the homepage or am I doing something wrong?

Many thanks to anyone you has an idea!

thellert · February 2, 2024, 11:24pm

Thank you very much!

I realized that I imported OpenAI through llama-index and they probably didn’t update their library

_j · March 1, 2024, 9:51pm

Took a while to catch on, but:
OpenAI also just updated their pricing page so it can be seen in units of 1M…

You have to scroll down to find the higher prices of “older models” in their own section.

They need to state clearly if fine-tune price reductions announced DevDay apply only to the cheaper -1106 and -0125 models. Or also if all fine-tunes have the same inference cost.

Or if right below the June 2024 shutdown notice of a better performing gpt-3.5-turbo-0613, fine-tune cost doesn’t matter because: “Fine-tuned models created from these base models are not effected by this deprecation, but you will no longer be able to create new fine-tuned versions with these models.” (you can still choose -0613 in the UI, but I haven’t ran one…)

Or more fine print, about the most accomplished gpt-3.5-turbo-0301 and gpt-4-0314 models:

As of 01/10/2024, only existing users of this model will be able to continue using this model.

This text from the pricing page continues to be incorrect:

a text excerpt explaining the concept of "tokens" as units for counting words, highlighting that 1,000 tokens are roughly equivalent to 750 words and that the paragraph shown is 35 tokens long, with a token count indicator at the top showing "58."

Topic		Replies	Views
Trying to understand why usage cost does not match usage tokens API gpt-35-turbo , api	7	2046	February 8, 2024
Is OpenAI charing me wrong? API	1	580	December 4, 2023
Pricing of legacy models? API gpt-4 , api , pricing	1	4542	February 4, 2024
API calls pricing - single request gpt-3.5-turbo-0613 total tokens 57 costs 0.0103? API	3	2269	December 29, 2023
Hidden GPT 4 Turbo charge for requests? API gpt-4 , pricing , gpt-4-turbo	7	2133	December 10, 2023

Gpt-3.5-turbo API pricing

Related topics