Is the Assistants API really made to create efficient chatbots?

joelomar4.stelorder · February 27, 2025, 8:10am

Has anyone created a chatbot with this? Because it is really VERY slow. Even for a simple message with a Assistant without any configuration, testing from the playground it can take up to 10s to reply. I’ve read in some forum posts that many people recommend using Completions and I think that’s what I’m going to go for. Or is there any way to improve the response time using Assistant API?

vb · February 27, 2025, 11:06am

Hi, and welcome to the community!

The Assistants API has never been the fastest, but many prefer it for its ease of use in managing conversation history, file uploads, a knowledge base, and tools like the code interpreter.

That’s the main advantage of using the Assistants API—it enables quick and easy prototyping. If your use case isn’t time-sensitive, it may be a sufficiently good option.

On the other hand, using the Completions API requires you to build all the necessary functionality yourself. As you’ve already discovered, it typically has lower latency and offers more flexibility to optimize overall system performance.

I also want to mention that OpenAI is still actively developing the Assistants API. Personally, I recommend using the Completions API, but that’s just my opinion.

Innovatix · February 27, 2025, 11:43am

Vb has already answered your question.

The Assistants API is in beta and primarily designed for retrieval-augmented generation (RAG), which involves multiple steps like query optimisation, retrieving, reranking, and generating responses, also tools/functions use—leading to an average latency of 5-10 seconds.

If speed is the priority, use a API instead of assistant with a fast model, limit nodes, enable streaming, keep the persona concise, and restrict output tokens.

However, the Assistants API simplifies performing RAG and tools/functions, making it useful for quick prototyping.

If your use case isn’t time-sensitive, it can still be a decent option.

bragma · February 27, 2025, 8:03pm

Agreed. I’m timing requests and it takes 2-4 seconds only to queue up the run. Unacceptable. At least streaming makes it nearly bearable but any chatbot using it crawls on until the user quits out of boredom.

vb · March 1, 2025, 8:04pm

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How can I make my assistant responses faster? API assistants-api	2	1587	November 3, 2024
Why Assistants API is Slow? Any speed solution? API api-speed , openai , rag , assistants-api	15	9005	September 10, 2024
The optimal way to build AI Chatbots? API	7	400	March 6, 2025
Speed with API Assistant model gpt4 mini API api , assistants-api	2	327	January 13, 2025
Assistant API Performance is Very Slow API plugin-development , api	10	5317	March 7, 2024

Is the Assistants API really made to create efficient chatbots?

Related topics