Super slow Assistants API responses

sbm · February 4, 2024, 7:23pm

Hello everytime I speak the api, it is taking super long to respond, it is taking up to like 30 seconds

_j · February 4, 2024, 7:25pm

Assistants should not be the primary way that you interact with AI models.

Chat Completions + streaming = get the output starting within seconds.

Assistants: Multiple API calls to set something in motion, and then more waiting to find out when it’s done.

sbm · February 4, 2024, 7:28pm

what do I do instead, I need to have an Ai trained on data and can respond like an Assistant, is there another solution?

_j · February 4, 2024, 7:36pm

“assistants” is just a poor confusing name for an agent framework that can make multiple calls internally. You aren’t “training an AI on data”, it is getting data injected or data searchable with internal functions.

so adapt to:

Chat Completions + retrieval-augmented generation + streaming = get the output starting within seconds.

where RAG is employing a semantic search vector database to inject the AI context with relevant results for the user input before the main AI even generates a token.

You also can evaluate the production rate of tokens on individual models. gpt-3.5-turbo-0125, for example, being new, was making 100 tokens per second.

tytung2020 · February 5, 2024, 6:03am

yes but I like the thread management function.
I don’t want to code many functions to fetch chat history + summarize + fetching database storage for the history file.

sbm · February 6, 2024, 7:21pm

is there a video on youtube about this? I would love to explore

Also I heard that if I become a higher tier, the assistants reply faster

_j · February 6, 2024, 7:41pm

Yes, OpenAI had throttled the token generation rate of some models for those under “tier 1”. I haven’t heard much forum complaint about this recently (after they hit a whole bunch of API users with slower output without announcement) so I can’t speak to what improvement you would see by prepaying more to get to a higher payment trust tier.

Here’s a link for courses. “semantic search” “vector database” are what you are after, including some sponsored by OpenAI.

pranavcybsec · February 6, 2024, 7:54pm

You could explore LangChain https://python.langchain.com/docs/get_started/introduction. It will also allow you to have greater control over which chat model you want to use, and the process to setup the knowledge base (from which AI agent can retrieve context) is quite simple. Plus, you won’t have to pay anything extra to index your documents unlike when using the retrieval tool with Open AI’s assisant API. You can also choose to use free cloud-based vector stores to host your data index online.

rpieka · August 3, 2024, 10:43am

Hello, I am just starting with API Assistant and have the problem with status of my prompt.
Here is my code (VBA):

'1. Check if my assistant (assistantID) exists
url = "https://api.openai.com/v1/assistants/" & assistantID

    ' Create XMLHTTP object
Set http = CreateObject("MSXML2.XMLHTTP")
With http
    .Open "POST", url, False
    .setRequestHeader "Content-Type", "application/json"
    .setRequestHeader "Authorization", "Bearer " & apiKey
    .setRequestHeader "OpenAI-Beta", "assistants=v2"
    .Send
    response = .responseText
End With
Debug.Print "Assistnt Response: " & response

'2. Create Thread and Run in one step
url = "https://api.openai.com/v1/threads/runs"
payload = "{""assistant_id"": """ & assistantID & """, ""thread"": {""messages"": [{""role"": ""user"", ""content"": """ & JsonEncode(mailbody) & """}]}}"
Debug.Print "Tekst payload " & payload

    ' Create XMLHTTP object
Set http = CreateObject("MSXML2.XMLHTTP")

With http
    .Open "POST", url, False
    .setRequestHeader "Content-Type", "application/json"
    .setRequestHeader "Authorization", "Bearer " & apiKey
    .setRequestHeader "OpenAI-Beta", "assistants=v2"
    .Send payload
    response = .responseText
End With
Debug.Print "Run response: " & response

'3. Get Thread ID (created in the previous step)
Set jsonParsed = ParseJson(response)
threadID = jsonParsed("thread_id")
Debug.Print "Pobrane threadId: " & threadID

'4. Get Run ID (created in the previous step)
Set jsonParsed = ParseJson(response)
runId = jsonParsed("id")
Debug.Print "Pobrane runId: " & runId

'5. Get Run status
url = "https://api.openai.com/v1/threads/" & threadID & "/runs/" & runId

    ' Create XMLHTTP object
Set http = CreateObject("MSXML2.XMLHTTP")

 'here should be the loop repeating code in each 3 seconds to check status
With http
    .Open "GET", url, False
    .setRequestHeader "Content-Type", "application/json"
    .setRequestHeader "Authorization", "Bearer " & apiKey
    .setRequestHeader "OpenAI-Beta", "assistants=v2"
    .Send
    response = .responseText
End With
Debug.Print "Run status: " & response

Set jsonParsed = ParseJson(response)
runStatus = jsonParsed("status")
Debug.Print "Status runa: " & runStatus
'end of the loop

The problem is that the status is ‘in progress’ since about an hour or more. What is wrong with that?

Here is the answer of API:

Run status: {
“id”: “run_eu…”,
“object”: “thread.run”,
“created_at”: 1722676908,
“assistant_id”: “asst_ik…”,
“thread_id”: “thread_…”,
“status”: “in_progress”,
“started_at”: 1722676908,
“expires_at”: 1722677508,
“cancelled_at”: null,
“failed_at”: null,
“completed_at”: null,
“required_action”: null,
“last_error”: null,
“model”: “gpt-4o”,
“instructions”: “Introduction:\Here are instructions for the assistant with the Single Source of Truth…”,
“tools”: ,
“tool_resources”: {},
“metadata”: {},
“temperature”: 1.0,
“top_p”: 1.0,
“max_completion_tokens”: null,
“max_prompt_tokens”: null,
“truncation_strategy”: {
“type”: “auto”,
“last_messages”: null
},
“incomplete_details”: null,
“usage”: null,
“response_format”: “auto”,
“tool_choice”: “auto”,
“parallel_tool_calls”: true
}

PaulBellow · January 13, 2025, 8:35pm

4 posts were merged into an existing topic: I think API is not working for assistant right now

Topic		Replies	Views
I think API is not working for assistant right now API	47	808	February 16, 2025
Speed with API Assistant model gpt4 mini API api , assistants-api	2	328	January 13, 2025
20, 30 sec assistants API answer Feedback api , assistants-api	11	656	February 21, 2025
Assistant APIs is not responding, the run stuck in "in-progess" step API gpt-4 , api , assistants-api	11	1498	January 22, 2025
Has the Assistant API become slow even for fetching run data? API assistants	5	1626	January 31, 2024

Super slow Assistants API responses

Related topics