OpenAI Agent SDK – Token usage and reasoning output

bhushangawale · July 15, 2025, 12:33pm

I’m currently using the OpenAI Agent SDK with an Azure-hosted reasoning model (o1). The setup is working well, and I’m successfully receiving responses from the Agent via chat completions.

In the response object, I can see the usage node which includes both output_tokens and reasoning_tokens.

I have a couple of questions for this:

Is the output_tokens count inclusive of reasoning_tokens?
I’m trying to determine the actual token usage cost, and understanding this breakdown will help.
Is there any way to configure or enforce the model not to emit reasoning_tokens in the Agent’s response?

Any insights, documentation pointers, or best practices would be highly appreciated. Thanks in advance!

_j · July 15, 2025, 5:49pm

On the OpenAI API:

output_tokens is the total billing for AI generation
reasoning_tokens is the portion of that unseen by being internal

The usage details in the API object return, which contains reasoning_tokens, along with other categories like cached amount of input, can just be ignored if you don’t find the information useful to present.

Topic		Replies	Views
How the reasoning_tokens were calculated? API o1-mini	1	1401	January 28, 2025
Discrepancy in Token Counts Between tiktoken and API Usage for o4-mini/gpt-4o-mini Bugs api	1	132	May 28, 2025
Single word "code" response consumes 199 tokens using 4o-mini API o4-mini	5	62	July 8, 2025
Am I begin overcharged for o1-mini? API o1-mini	5	505	September 30, 2024
Question about o3-mini token counts and thinking tokens in general Community o3	2	3493	February 1, 2025

OpenAI Agent SDK – Token usage and reasoning output

Related topics