penAI API responses are extremely slow (30–40s) even with server setup

aymen_Braham · August 20, 2025, 2:48pm

Hi everyone,

I first created an Assistant on the OpenAI platform and integrated it into my application. Since the responses were extremely slow when calling the API directly, I decided to build a Python project (FastAPI) and host it on my OVH server to try to improve the performance.

Here’s what I did:

Created an Assistant on the OpenAI platform.
Connected my app to it using the OpenAI API.
Hosted the Python project on my server to handle the requests.

However, the latency is still very high:

On my server, responses take about 20 seconds.
If I call the API directly, it can take around 40 seconds.

I also tried a second solution (optimizing the polling loop, reducing delays, etc.), but the performance is still far too slow for real-time use.

This makes the Assistants API very difficult to use in production apps where users expect answers within a few seconds.

Has anyone else experienced this? Is such latency normal for the Assistants API, or is there a recommended way to make responses faster?

Thanks in advance for your insights!

Topic		Replies	Views
8-12 Seconds Response Delay with OpenAI API Using Node.js and WhatsApp API API api	2	718	January 15, 2025
Tips for Speeding Up Assistant Responses with Assistants API API assistants-api	2	899	September 6, 2024
How can I make my assistant responses faster? API assistants-api	2	2643	November 3, 2024
Assistant API Performance is Very Slow API plugin-development , api	10	5643	March 7, 2024
20, 30 sec assistants API answer Feedback api , assistants-api	11	1068	February 21, 2025

penAI API responses are extremely slow (30–40s) even with server setup

Related topics