OpenAi API - get usage tokens in response when set stream=True

yuvalapidot · April 8, 2023, 6:47pm

I agree I would like to have the usage in the response to stream requests (at least in one of the events in the stream), similar to usage for non stream requests.

However, a solution for the meanwhile -

The number of prompt tokens can be calculated offline using tiktoken in python for example (This is a guide I used)
The number of the events in the stream should represent the number of the tokens in the response, so just count them while iterating,
By adding 1 and 2 together you get the total number of tokens for the request.

I want to stress out that I would still like to receive the usage in the response to the request and not have to compute it myself.

Topic		Replies	Views
Why there is no USAGE object returned with Streaming Api Call? API api , chat-completion , completions	20	5469	February 20, 2025
Usage stats now available when using streaming with the Chat Completions API or Completions API API api , api-usage , streaming	25	19550	January 23, 2025
Usage Info in API Responses Announcements	20	12008	September 27, 2023
Token usage calculation with streaming responses - is this not supported? Feedback	1	92	June 25, 2025
Issue with Token Usage in Streaming Responses Bugs api	17	1233	February 21, 2025

OpenAi API - get usage tokens in response when set stream=True

Related topics