Thanks, @linus for your responses and no problem about the delay. As you rightly guessed had started using OpenAI Embeddings with Vector Store and Langchain to build a solution. Works well overall. But randomly, for the same input text and question it ends up taking up to 7 minutes to generate a response.