Slow Responses From POST https://api.openai.com/v1/chat/completions

FidelCastrosGoldRaft · October 17, 2025, 4:28am

I am making POST requests to the https://api.openai.com/v1/chat/completions endpoint to have it return in JSON format a list of properties found in free flow text. I am using 500-1,000 prompt tokens, 1500-2,500 completion tokens, and ~2,000 reasoning tokens per request - numbers which I see as relatively small.

Each request takes between 15 seconds to 30 seconds before a response is received back. Is this normal? I’ve tried all the different variations of the gpt-5 model (vanilla, mini, and nano) and the response times for all of them fall between 15 to 30 seconds.

For context, I have a new OpenAI account and I am a “Tier 1” customer right now. I’ve read in other posts that Azure hosted OpenAI will have faster response times, but it seems hard to believe that small requests to https://api.openai.com/v1/chat/completions with very little reasoning should consistently take this long?

ThatMasonGuy · October 17, 2025, 7:08am

I had the same issue with gpt-5, I switched back to 4o for the meantime.

Topic		Replies	Views
Completion API performances (response time) API	1	116	November 8, 2025
OpenAI API takes too long to response API api	2	964	March 25, 2024
API response time is insane (60+ seconds) API	3	1896	December 4, 2023
Chat Completion API extremely slow and hanging API	7	5617	December 4, 2023
I am sending request to open ai function calling and it is taking around 40-50 seconds API	0	66	December 11, 2025

Slow Responses From POST https://api.openai.com/v1/chat/completions

Related topics