Hi all — posting a reproducible bug on the Responses API in case anyone else is hitting it, and to get more eyes on it.
Summary
POST /v1/responses returns HTTP 500 server_error when all three of the following are present:
-
model: "gpt-4.1-mini" -
tools: [{ "type": "web_search" }] -
A specific class of input prompt (see below)
Changing any single one of those variables makes the request succeed. The failure is deterministic across multiple sessions over the past several days.
Minimal repro
Fails (HTTP 500, ~7–8s TTFB):
curl https://api.openai.com/v1/responses \
-H "Authorization: Bearer $OPENAI_KEY" \
-H "Content-Type: application/json" \
--data '{
"model": "gpt-4.1-mini",
"input": "What are the best AI-powered collaboration workspaces for teams in 2026?",
"max_output_tokens": 2048,
"tools": [{ "type": "web_search" }]
}'
Response:
{
"error": {
"message": "An error occurred while processing your request... Please include the request ID req_XXXX in your message.",
"type": "server_error",
"code": "server_error"
}
}
Controls (all succeed)
| Change from failing request | Result |
|---|---|
Remove tools (keep model + prompt) |
HTTP 200, ~16s |
Keep tools, change prompt to "What is the capital of France?" |
HTTP 200, ~1.3s |
Keep tools + prompt, change model to gpt-5-nano |
HTTP 200, ~59s |
So: the bug requires gpt-4.1-mini × web_search × this particular input together. Each variable in isolation works fine.
Failing request IDs
Five from two separate sessions:
-
req_2511bd1f82f04ff88ec63e1385705044 -
req_071d4913a42241f7a00cadef346b75b5 -
req_8196a931b7944e0590056742ff58ad2f -
req_2aec8b9613324d83923773068d93f8ae -
req_66c902aa815849fb866fd6691c048505
Observation that may help triage
Both failing requests return in 7–8 seconds. The matched gpt-5-nano control with the same prompt and web_search enabled runs for ~59 seconds before completing successfully. That gap suggests gpt-4.1-mini is bailing out early during tool orchestration rather than timing out on a long-running search — feels backend-side, not search-side.
What I’ve ruled out
-
Not an auth or rate-limit issue (other requests on the same key/org succeed immediately before and after).
-
Not network — direct curl to
api.openai.com, no proxies, no SDKs, full HTTP trace captured. -
Not specific to one input — variations of “best X in 2026” style prompts with
gpt-4.1-mini+web_searchshow the same behavior. -
Reproducible across days, different sessions, fresh API keys.
Filed via support a week ago (case 08421197) but the response loop has been asking for HAR files, batch IDs, conversation URLs, and most recently questions about Codex — none of which apply to a direct curl call against the Responses API. Posting here in the hope of getting it in front of the API platform team.
Anyone else seeing this? And if any OpenAI staff want full curl traces (HAR-equivalent, sanitized) or the mp4 recording I already sent to support, happy to share — just ping me.
Thanks.