Limiting maximum number of reasoning tokens

aprendendo.next · June 11, 2025, 7:48pm

Yes, you can limit it with the parameter max_output_tokens (reasoning is part of the output).

Controlling costs

If you’re managing context manually across model turns, you can discard older reasoning items unless you’re responding to a function call, in which case you must include all reasoning items between the function call and the last user message.

To manage costs with reasoning models, you can limit the total number of tokens the model generates (including both reasoning and final output tokens) by using the max_output_tokens parameter.

Topic		Replies	Views
I need help using openai API API chatgpt , gpt-4o-mini	2	246	October 29, 2024
Token limit -API for input API token	1	846	November 23, 2024
Max tokens chat completion gpt4o API gpt-4o	4	18221	September 5, 2024
Clarification about max_completion_tokens rate-limiting API rate-limit , o1-preview	4	896	October 10, 2024
Error Encountered When Using max_tokens Parameter with GPT-4 API API gpt-4 , api	5	2900	December 19, 2023

Limiting maximum number of reasoning tokens

Controlling costs

Related topics