GET completion/{id} endpoint for usage backfill

galbc · September 27, 2023, 5:16pm

We currently use the completions API with streaming. This means that the token usage is not provided in the response body. It would be great to call GET completion/{id} or something similar to capture token usage to backfill into our database. It seems that the OpenAI platform “usage” page’s frontend already utilizes such a GET call:

_j · September 28, 2023, 11:29am

Yes, that’s a decent idea, but it is also latency and network resources completely unnecessary and which would delay the decision-making for the next round of a chat calculation.

The “call” can be to your own token count in your software. You know how much you send, you know how much you receive. You can count the tokens before you even send, in order to make decisions about how much other stuff you can send or how big a response to get back.

The only thing you can’t understand from the response is the price of failure.

Topic		Replies	Views
Token count for completion call? API	6	2167	December 19, 2023
Chat completion "stream" API token usage API api	3	6398	May 6, 2024
Gpt4o completion API token usage output API	3	248	October 30, 2024
Feature request: Return image tokens in usage response Feedback api , api-usage , usage-page	2	195	January 26, 2025
Token usage info not available in some endpoints Feedback api-billing	0	56	March 27, 2025

GET completion/{id} endpoint for usage backfill

Related topics