Limit of output tokens in API for web search AI models

Alex_BCH · June 24, 2025, 7:09pm

Hello folks!

I have a quick question please.

The web search tool is currently limited to GPT-4o and GPT-4.1 models, which have a maximum output of 32,768 tokens despite a large 1,047,576-token context window. This output token limit feels a bit restrictive for web search models.

Given the size of the context window, is there any plan to increase the max output token limit for these web search models?

jai · June 24, 2025, 11:30pm

That’s a fair ask. Was there a particular report you were trying to generate, or a use case you have in mind that would leverage a higher max output token limit?

For what it’s worth, the o3 and o4-mini models with 100,000 max output tokens are industry leading at the moment (in terms of output token limit), and I might say equally suitable for web search with their 200,000 context windows.

Topic		Replies	Views
Is the "output (Maximum length)" for the GPT-4-1106-preview API still capped at 4095? API gpt-4 , gpt-4-turbo	3	7689	November 15, 2023
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	14261	January 11, 2024
Only allowed to set max_tokens to 4095 API	4	712	May 17, 2024
Max token output for GPT-4 (Non-Turbo)? API gpt-4 , gpt-4-turbo	2	6360	January 26, 2024
Maximum token allowed for chat gpt model gpt 3.5 turbo API chatgpt	3	2803	February 15, 2024

Limit of output tokens in API for web search AI models

Related topics