I believe the issue here arises because OpenAI injects their own message regarding how to reproduce from news. The AI sees this (and not you). It inspires the AI to write:
One has to co-opt within the internal iterations for such revelation.
Then if clearly provided more instructions, such as your deliberate “search web, then search files”, there is an iterator in Responses, and more internal unobservable calls can occur to more tools, resulting in more output. The list of outputs is clearly designed to handle such scenarios, where the AI could even be instructed to, without intervention, call and report on multiple queries automatically.
Your own function-call loop that you can receive both “content” and “tool_call”, show and act on them, and then upon return the AI can issue more tool calls and is not barred from producing user-seen content also? Responses has that built in.
If you don’t want AI running out of your control, by language you didn’t produce, you will have to use your own API service for web search and RAG.
