Max_tool_calls being ignored

Benedict_Summers · November 24, 2025, 5:40pm

I am using the responses API to hit the deep research models (both o4 and o3).

It seems to totally ignore what I set for max_tool_calls, often having 100+ “response.web_search_call.completed” in the stream output.

These docs https://platform.openai.com/docs/guides/deep-research state that this should be respected, and is the main way of controlling costs. Am I missing something?

Thanks!

MODEL = “o4-mini-deep-research”

stream = client.responses.create(
model=MODEL,
input=[
{“role”: “developer”, “content”: [{“type”: “input_text”, “text”: “You are a research assistant. Cite sources.”}]},
{“role”: “user”, “content”: [{“type”: “input_text”, “text”: prompt}]}
],
tools=[{“type”: “web_search”}],
max_tool_calls=5,
background=True,
stream=True,
store=True # Required for background mode
)

https://platform.openai.com/docs/guides/deep-research

Topic		Replies	Views
Gpt-5 calls web_seach_preview tool 10x more than before after 5th of September API	3	358	September 19, 2025
Openai web search token limit issue Bugs	4	458	March 25, 2025
Does OpenAI charge you for failed (timeout error) requests? API	1	1495	April 18, 2023
O3-deep-research - 1 million tokens spent .. no output :( API deep-research	37	2107	November 27, 2025
File search disregards max num results Bugs gpt-4 , api , assistants-api	4	702	April 18, 2025

Max_tool_calls being ignored

Related topics