How to improve OpenAI Assistants API (File Retrieval) with response time?

So I’m using the Assistants API with File Retrieval and it is just taking too much time. In most cases, the average response time is around 50 seconds and for absolute basic queries it still takes 10 seconds. In the worst case scenario it even takes 3 minutes!
So I’m gonna need some help on this because I want this to be production-ready as my entire chatbot is based around it.
You can even suggest a better RAG alternative to the Assistants API which can give me faster responses while maintaining similar functionality.

Hi @fateh.ali - based on communication from OpenAI staff, it looks like an Assistant’s update is around the corner :slight_smile:

Hi,
Any improvement here? its still taking 15 to 20 seconds on avg with a small document ? is there any way to speed up ?