How can we count the used tokens in a conversation?

gabegabe · May 16, 2023, 8:17pm

Based on the documentation for a proper conversation we have to send a list of messages repeatedly. I would like to know how we can count the used token in this case for cost estimation.

Let’s say my system message/prompt is 1k tokens.
The 1st user message is 1k tokens
The response is 1k tokens as well from the bot

Then this means we’ve used 3k tokens with the first message-response pair.

For the 2nd user message, we need to send the entire previous 3k tokens and also the new message (which is 1k tokens again) and the response is 1k tokens as well.

Now we spent 3k + 2k = 5k tokens for the 2nd message pair.

The whole session was costing us 3K + 5k tokens, am I right?
In real life as the maximum input is 4096 tokens, we would get an error for the 2nd message, am I right?

Can you give me the formula for the calculation?

sps · May 16, 2023, 9:30pm

Welcome to the community @gabegabe

Here’s the notebook for using tiktoken to count tokens for chat API call

jwatte · May 17, 2023, 12:15am

Yes, that is correct.

Tricks that you can use include using the model to summarize longer requests and responses, so you can send less, and sometimes just sending previous user questions without the bot answers will be good enough.

Topic		Replies	Views
How many tokens is normal usage for asking a question? API chatgpt	7	16097	September 6, 2024
Estimating GPT API Conversation Costs: Factoring in Cumulative Input and Output Tokens API api , token	2	1688	October 2, 2023
Tokens usage on Response API with previous message API gpt-4 , responses-api	2	79	July 28, 2025
Counting tokens for chat API calls (gpt-3.5-turbo) Documentation	5	27954	December 13, 2023
How to count tokens from code interpreter usage? API	3	2896	January 16, 2024

How can we count the used tokens in a conversation?

Related topics