Assistant API performance

amoradian · May 28, 2024, 8:01pm

Assistant is AMAZING !! But currently it takes around 30s to get response from assistant that includes one or two tool calls (including planning and a confirmation message after).

Any news on when that will be improved ? Does having OpenAI enterprise account help with that ?

We’re using Assistant API V2, and gpt-4-turbo, (gpt-4o had lower accuracy, and didn’t improve response time that much)

kgupta · June 1, 2024, 2:06am

I gave up on assistants api due to performance. I now use it only for thread management. Recent updates allow adding messages as an assistant so i basically call the completion api for function calling and once i receive the tools output and execute the function, the data response is passed to completion api with streaming and response from streaming is saved back to assisntants thread messages as an assistant role

amoradian · June 3, 2024, 7:04pm

We were thinking about the somewhat similar approach. In your experience, have you noticed any difference in function calling and specially handling errors and fixing those based on response feedback ? we heavily rely on providing feedback to tool calls to make it correct it’s response and can’t go without that.

Munna23 · June 5, 2024, 4:24pm

@amoradian -In function calling you can handle errors returning the error message back to
the LLM to submit tool outputs and poll if required. It understands the context and re-prompts user with feedback. Once the user enters the permitted value, it calls the tool again using context from the previous conversation. If you are using AzureOpen AI content filter toggle might cause slow outputs and members of the forum reported lightning fast responses once they had dealt with it. Hope this helps, cheers!

Notes: Good prompt engineering with proper description of tools with appropriate temp and top p values should do it. You could use a lower value close to 0 for top-p to retrieve higher probability tokens w.r.t context.

Topic		Replies	Views
Assistants API Performance API api , assistants-api	11	2901	March 21, 2024
Assistant API Performance is Very Slow API plugin-development , api	10	5339	March 7, 2024
Why Assistants API is Slow? Any speed solution? API api-speed , openai , rag , assistants-api	15	9107	September 10, 2024
20, 30 sec assistants API answer Feedback api , assistants-api	11	693	February 21, 2025
Response of gpt-4-turbo is taking more time API gpt-4-turbo , assistants-api	9	2456	December 11, 2023

Assistant API performance

Related topics