Creating a zero-latency feeling with assistant

shapeden · December 10, 2023, 3:12pm

I want to give my users the feeling that the assistant works in “zero-latency”. I want the user to receive a friendly response from the assistant immediately after asking a question - before any tools are used! Some of the tools are functions that could take a very long time to return, and I don’t want to keep the user waiting so long for the first response.

BTW I’m streaming by fetching the thread steps in a loop and extracting all of the “text” typed messages.

scharleswatson · December 10, 2023, 8:34pm

Just use javascript to throw in a “working on this” type of response and then replace it with the real response when you get it. At this point though the assistant api is so slow and inconsistent you probably should not be using it in any sort of production scenario

shapeden · December 11, 2023, 1:47pm

Do you know a better wrapper that could be used like an assistant and is more production ready?

dutmustafasuban · December 11, 2023, 2:04pm

If you are not dependent on openai for the system. You can run Langchain as an interlocutor using an LLM like LLama. You can train LLama to direct Openai for challenging tasks.

scharleswatson · December 11, 2023, 7:00pm

Agree with the other comment about maybe using some other platform. For me, i use the chat completions since I don’t need code interpreter, and any file related things I can just vectorize and just match against them. I think in theory the assistant api has lots of potential but just isn’t up to snuff right now. We will see how things look in the new year, as this tech is moving quite fast.

Topic		Replies	Views
Assistant API performance Feedback gpt-4-turbo , assistants-api	3	906	June 5, 2024
The optimal way to build AI Chatbots? API	7	441	March 6, 2025
Is the Assistants API really made to create efficient chatbots? API assistants-api	4	257	March 1, 2025
Speeding up the response from the openai's assistant api API gpt-4 , assistants-api	2	2238	July 17, 2024
Assistants API is too slow! API assistants-api	26	4394	March 16, 2025

Creating a zero-latency feeling with assistant

Related topics