Embeddings API Response Slow

jeremy.wagner · January 29, 2024, 8:45pm

We are using the embeddings API to answer a question and the response is taking upwards of 20 seconds on average. Is there a way to speed this up?

supershaneski · January 29, 2024, 11:51pm

Which part of your QnA is taking time? Embedding the user query? Searching for answer against the stored vector data? Final chat completions API call?

atlashedged · January 30, 2024, 8:07am

We have the same issue with voice response. Averaging 15-17 seconds for a response. No user will tolerate that. Would love to hear from others what they have figured out to speed this up.

Topic		Replies	Views
Assistant API Performance is Very Slow API plugin-development , api	10	5048	March 7, 2024
Embedding API - long wait? API api	3	1398	June 12, 2023
Chatgpt-3.5 turbo model takes long time to respond. Is there any way to speed this up? API gpt-35-turbo , api-speed	7	6519	December 19, 2023
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	3	20300	November 9, 2023
Using ChatGPT 3.5 Turbo with Langchain is excessively slow API chatgpt , langchain	3	2927	October 21, 2023

Embeddings API Response Slow

Related topics