Why was max_tokens changed to max_completion_tokens?

atty-openai · September 13, 2024, 3:23am

Hi, Atty from OpenAI here — max_tokens continues to be supported in all existing models, but the o1 series only supports max_completion_tokens.

We are doing this because max_tokens previously meant both the number of tokens we generated (and billed you for) and the number of tokens you got back in your response. With the o1 models, this is no longer true — we generate more tokens than we return, as reasoning tokens are not visible. Some clients may have depended on the previous behavior and written code that assumes that max_tokens equals usage.completion_tokens or the number of tokens they received. To avoid breaking these clients, we are requiring you opt-in to the new behavior by using a new parameter.

More documentation here: https://platform.openai.com/docs/guides/reasoning/controlling-costs

Topic		Replies	Views
Introducing OpenAI o1-preview \| New OpenAI Announcement API announcement , chatgpt , news	39	6490	September 17, 2024
Confused about max_tokens - parameter with GTP4-turbo (128k-tokenUsedForPrompt or 4K) API	16	7511	May 11, 2024
Max_tokens seems to do nothing for me 3.5 Turbo API	14	3508	December 18, 2023
What is going on with the GPT-5 API? API	40	16941	October 21, 2025
Not allowed to have all 8192 tokens API gpt-4	16	12805	December 18, 2023

Why was max_tokens changed to max_completion_tokens?

Related topics