Gpt-3.5-turbo, contexts and pricing

mojave · September 19, 2023, 5:39am

I have been using ‘gpt-3.5-turbo’ as the model in my API calls. Should I always be using one of the specific context models ‘-4k’ or ‘-16k’, depending on my needs? What context size is used if you just set model to ‘gpt-3.5-turbo’?

And when should I use a dated model like ‘gpt-3.5-turbo-0613’?

I want to do more specific tracking of my token pricing.

_j · September 19, 2023, 8:59am

You have some observations where documentation is often not exacting and an AI can’t answer.

The standard context length of gpt-3.5-turbo is 4k - 4191 available tokens, that must hold both all the input you send and the response you want the AI to generate. You don’t and can’t specify -4k as a suffix.

The name redirects to the currently recommended model, gpt-3.5-turbo-0613, and your reply metadata and accounting will show this as the model used when you invoke by calling the model gpt-3.5-turbo.

Employing the -16k model costs you exactly twice for all interactions, even if your API call doesn’t need its features. So no, you wouldn’t want to use it unless your input or output calls for the larger context length. A strategy could be to select it dynamically in software specifically for allowed cases of larger inputs, for example.

Token tracking and counting within software can be done by a library that computes the encoding, using the model’s actual internal AI encoding dictionary, such as tiktoken. This allows strategies, like seeing exactly how large the past conversation replies of old chat are, to decide what must be discarded.

General usage by tokens can also be seen in the usage page of your OpenAI account, summarized by 5-minute granularity in the daily view.

Hopefully this informs the bigger question you have in mind!

Topic		Replies	Views
Gpt-3.5-turbo API pricing API	2	83315	March 1, 2024
What is the context window of the the new GPT 3.5 Turbo model (gpt-3.5-turbo-0125)? API	7	20814	February 11, 2024
Insights on ChatGPT Enterprise Using GPT-4-1106-Preview Based on Context Length Specifications Community gpt-4 , chatgpt	4	12980	January 20, 2024
What is the Prompt and Completion price in GPT-4 api? API gpt-4	10	11474	June 12, 2023
The current pricing applies to older models? Feedback api , pricing	9	6051	November 23, 2023

Gpt-3.5-turbo, contexts and pricing

Related topics