Responses.create() hangs indefinitely with large payloads on GPT-5 models (works fine with same inputs in O3 and small inputs tests in GPT-5)

Pedro_Rovieri · October 30, 2025, 8:52pm

When calling the Responses API (client.responses.create()) with GPT-5 models and large input payloads (~35k tokens), the API call never returns. The OpenAI platform shows the request was processed successfully (visible in logs/dashboard) and a response (output) was correctly generated, but the Python SDK hangs indefinitely waiting for the response. The same code for the api call works perfectly with O3 models or with smaller inputs on GPT-5.

Ks_Ks · October 31, 2025, 2:44am

i’m seeing the same issue with the Responses API that started yesterday. On background mode, i see that large inputs to Gpt5 / Gpt5mini / Gpt4.1 seem to be queued indefinitely and never transitions to in-progress.

GPT 4.1-mini is slightly better, and the job transitions to in-progress, but what used to take < 1min to run now takes more than 5mins.

Are there any performance issues on OpenAI side?

Topic		Replies	Views
GPT 3.5-Turbo API call randomly hangs indefinitely API	10	4148	July 18, 2024
Gpt-5 stuck! gpt-5 throwing timeout errors Bugs gpt-5	5	986	September 10, 2025
Timeouts with ChatGPT 5 in API API gpt5	6	3328	November 4, 2025
Gpt-5 Requests Hanging on Cloud Bugs gpt-5	0	73	December 9, 2025
GPT-4.1 Responses API stalls once convo → ~350 k tokens (started 11 Jul 2025) — anyone else? API api , long-context , responses , gpt41	0	248	July 11, 2025

Responses.create() hangs indefinitely with large payloads on GPT-5 models (works fine with same inputs in O3 and small inputs tests in GPT-5)

Related topics