How to get total_tokens from a stream of CompletionCreateRequests

billpeace · March 21, 2023, 7:13am

how to get totaltokens from a stream of CompletionCreateRequests

To see how many tokens are used by an API call, check the usage field in the API response (e.g., response['usage']['total_tokens'] ).

response[‘usage’] is null

how to get total_tokens prompt_tokens completion_tokens ?

raymonddavey · March 21, 2023, 7:32am

Each response in the stream is one token. I have found that you also need to add one extra token as overhead.

There is no easy way to get the token count for the prompt part. You will have to use a tokenizer library. There are also overhead tokens for the start and end of each user, system, and assistant record.

Streams don’t pass the token count.

weibing.chen.xy · March 28, 2023, 4:35am

HI Raymond, are you sure about the each chunk is one token? Even if it is true, that only provides the complete_token, not the total token.

Just want to confirm it. I am seeking a way of finding the cost. The document mentions that the token will be sent via SSE, but from python, I do not the endpoint

weibing.chen.xy · March 28, 2023, 4:40am

I just double check with my code with stream and non-stream.

I can confirm that it is true that each chunk is one token, but quite close. The answer from stream and non-stream is not the same, but close, and their token cost (260, and 293) are also close.

This is only providing the complete_token.

Any one know how to find the promt_token?

raymonddavey · March 28, 2023, 6:25am

The only way I have been able to get tokens for the prompt is by using a tokenizer in my code

There also appears to be an overhead of a few tokens for each role record

In the end, the difference is small when you consider the cost is per thousand tokens and not per token.

When you also consider the cost is a small percent of a cent per thousand tokens, the difference doesn’t make much difference at all.

weibing.chen.xy · March 28, 2023, 6:40am

Agree. At the end of day I used the length of my promotion string divided by 4 to estimate the number of prompt tokens and used your way of counting chunks to estimate the completion ones. Thanks

Topic		Replies	Views
Streaming completion in Python API	11	23272	December 13, 2023
When we use streaming with open ai models, I am not getting the token count API	2	2218	December 19, 2023
Token count for completion call? API	6	2139	December 19, 2023
How to determine the token usage for a session when using stream:true? API	3	3421	December 19, 2023
OpenAi API - get usage tokens in response when set stream=True API	31	37314	August 3, 2024

How to get total_tokens from a stream of CompletionCreateRequests

Related topics