Taking too long to respond

xifaj78420 · January 1, 2024, 12:20pm

Hey, I’m using GPT 3.5 Turbo and I’m using assistant API for the application.

I know it take sometime to process the request with a longer system prompt and messages but when user messages consists of 2 and 3 then it replies faster but when it containing about 5 to 7 sentences it takes too long like 6 to 7 minutes and longer.

The issue here is latency, Do I have to shorter my system prompt but I don’t think it will help because the response is faster when user messages are shorter.

max tokens are also set to 256.

_j · January 1, 2024, 12:30pm

The model you must use is gpt-3.5-turbo-1106. It is the only one that supports parallel tool calls and has the longer context length to track function calls.

Why? I suspect you’ve also uploaded some documents, and the AI has gone crazy looping trying to retrieve everything with mismanaged functions. You can check the number of run steps for a run.

Also, log into your account and see your rate tier under limits. Tier 1 can get slower models (actually just slower output) and would make any internal writings take much longer.

max_tokens is not a thing to set on assistants. You don’t know how long the internal writing of commands has to be.

xifaj78420 · January 1, 2024, 12:37pm

I’m not uploading documents or etc.
It’s conversational but just getting information in conversational way from user with this

_j · January 1, 2024, 1:30pm

That seems as if you would be better using chat completions method for communicating with AI.

The code is more simple, you can get immediate words to read, and you also have control of the length of the old conversation that is sent each time.

Here is a link to small example code for one user with Python. You will be able to see how fast the AI can produce language. The amount the conversation history can grow is limited by number of turns.

Topic		Replies	Views
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	3	22638	November 9, 2023
Chatgpt-3.5 turbo model takes long time to respond. Is there any way to speed this up? API gpt-35-turbo , api-speed	7	6582	December 19, 2023
Why Assistants API is Slow? Any speed solution? API api-speed , openai , rag , assistants-api	15	9129	September 10, 2024
Chat Completion API extremely slow and hanging API	7	5215	December 4, 2023
ChatGPT API Very Slow at generating Responses API gpt-4 , api	8	5471	December 25, 2023

Taking too long to respond

Related topics