I have some questions regarding how billing is done with GPT-4-Turbo using the Chat.Completion

felipe.cesar · December 22, 2023, 2:10pm

I built a bot that has a system prompt of about 2000 tokens and some functions that have 450 tokens. Will every call I make to the API using Chat.Completion charge for all these tokens? If the conversation progresses a lot and I need to pass a larger conversation history for ChatGPT to have the context to respond, does it charge for the tokens cumulatively? I’m noticing that in a conversation that evolves and totals 2500 tokens, each call is charging the cumulative cost of the prompt + history + new messages. Is my understanding correct? Is there any way to reduce this cost?

TonyAIChamp · December 22, 2023, 2:20pm

Welcome to the forum, Felice!

Chat models are stateless, so each time you do request a completion, you send all of the instructions and the context you want AI to follow (generally it is system prompt and some form of chat history).

felipe.cesar · December 22, 2023, 2:34pm

So, will it always charge for the entire history, system prompt, and user prompt?

TonyAIChamp · December 22, 2023, 2:56pm

It charges for everything you send it and everything you get as a result.

_j · December 22, 2023, 3:17pm

…and therefore, you would want to employ a token-counting method in your local database of conversation history between AI and user, and use intelligent decision-making in your software.

You then can not just prevent errors of going over the context length, but also can prevent $1 per question bills by limiting the memory of a conversation to a particular number of turns or maximum input tokens.

yu.shlegel · December 22, 2023, 9:08pm

I can also recommend using the gpt-3.5-turbo model, it is 10 times cheaper and in some solutions it does even better)

felipe.cesar · December 26, 2023, 2:24pm

Thanks, i`ll try with gpt-3.5 too.

felipe.cesar · December 26, 2023, 2:25pm

Thanks!! this will help me improve my solution!

Topic		Replies	Views
How to optimize API request in terms of expenses API	8	1965	December 17, 2023
Pricing, Billing and Tokens? Math is not adding up API api	9	2277	February 16, 2024
Retain past responses in memory without sending them again at every API request API gpt-4 , gpt-35-turbo , chatgpt	11	9805	January 25, 2024
How to improvement my app to use less tokens Community gpt-4 , api	4	7250	July 8, 2024
Help on pricing chat bot? API gpt-4	16	1120	February 16, 2024

I have some questions regarding how billing is done with GPT-4-Turbo using the Chat.Completion

Related topics