Chat completion "stream" API token usage

khh3714 · September 5, 2023, 6:10am

Hello.

I’m currently using the chat completion “stream” API and need information on token usage.

I’m wondering if usage data for this stream API is planned to be provided, and if so, when it will be available.

There was a guide as below, and it’s fine for English, but the number of events and tokens are different for other languages (like Korean).

OpenAi API - get usage tokens in response when set stream=True - #2 by yuvalapidot

I’m wondering if there is a guide for token calculation for other languages. (Is there a more memory-efficient way than gathering all the responses and calculating with tiktoken?)

_j · September 5, 2023, 7:49am

Streaming of the response does not produce a count of the token usage. It only sends a “finish_reason” as the last delta piece.

There are no indications that the streaming method will be changed in the future. Changes could break almost every software in existence that relies on the current behavior.

The software for calculation of tokens is the same regardless of which language you are using to talk to the AI. You can receive the whole response into a appended string at the same time as you are displaying it. Then when finished, calculate the number of tokens in the complete response using the tiktoken library or similar.

Software manual: https://github.com/openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb

The forum post you linked is wrong. There is no guarantee that the number of pieces that you receive relates to the number of tokens that are within.

owencmoore · May 6, 2024, 11:43pm

OpenAI team here, we have now added a feature for this! See Usage stats now available when using streaming with the Chat Completions API or Completions API

Topic		Replies	Views
Calculating token usage with streaming? API	2	2583	May 6, 2024
Usage Info in API Responses Announcements	20	11731	September 27, 2023
Why there is no USAGE object returned with Streaming Api Call? API api , chat-completion , completions	20	5318	February 20, 2025
How to get token usage for each API call in streaming model? API	9	8424	December 14, 2023
OpenAi API - get usage tokens in response when set stream=True API	31	38758	August 3, 2024

Chat completion "stream" API token usage

Related topics