Default value of `max_output_tokens` of `responses.create` for `gpt-4.1-2025-04-14`

SoftTimur · June 8, 2025, 3:55am

Hello,

I would like to know what’s the default value of max_output_tokens of responses.create for gpt-4.1-2025-04-14.

If we don’t set this value, will it represent the maximum capacity of the model?

Could anyone help?

Thank you

Timur

PS: https://platform.openai.com/docs/api-reference/responses/create

OnceAndTwice · June 8, 2025, 3:58am

I believe 2048. You can test this by instructing the model to repeat a sequence infinitely and checking token consumption or running it through the tokenizer.

SoftTimur · June 8, 2025, 4:10am

I don’t see how to test it reliably, does AI always tend to give a response that’s close to max_output_tokens?

jai · June 8, 2025, 4:18am

In general, it does tend to be close to the max_output_tokens, if specified (sometimes above, sometimes below). As for the default value, I don’t think that’s mentioned anywhere in the docs or visible in the source code.

To build upon @OnceAndTwice’s idea, you could find any piece of text that is close to 32K tokens, send that as part of the input tokens via the API, and prompt the model to repeat it verbatim. For reference, the max output token length is 32,768 tokens for the GPT 4.1 model. If the model returns the entire text, then one can assume that max_output_tokens represents the maximum capacity of the model.

Topic		Replies	Views
Chat completions API - max_tokens default value is missing API api , chat-completion	1	3857	July 3, 2024
Max token output for GPT-4 (Non-Turbo)? API gpt-4 , gpt-4-turbo	2	6313	January 26, 2024
Regarding max input tokens of gpt-4o-2024-08-06 API gpt-4	3	3332	December 2, 2024
Max tokens of Azure Open AI model API	7	11689	May 12, 2023
What is the maximum response length (output tokens) for each GPT model? API	6	43404	November 7, 2024

Default value of `max_output_tokens` of `responses.create` for `gpt-4.1-2025-04-14`

Related topics