Usage stats now available when using streaming with the Chat Completions API or Completions API

Diet · May 7, 2024, 2:30am

Cool stuff.

Question:

Is there a reason you did it like this, as opposed to including prompt tokens in (or before) the first chunk and completion tokens with each chunk?

The way you did it can’t be used if a generation is canceled by the client (which is quite common when intercepting or preempting model errors)

Topic		Replies	Views
Why there is no USAGE object returned with Streaming Api Call? API api , chat-completion , completions	20	5957	February 20, 2025
Issue with Token Usage in Streaming Responses Bugs api	17	1552	February 21, 2025
Azure .net OpenAI Client: Usage disabled for streamed completion API azure , azure-openai	1	857	September 3, 2024
Stream_options not working for image_url contents Bugs api , streaming	4	364	December 24, 2024
OpenAi API - get usage tokens in response when set stream=True API	34	41122	August 17, 2025