Api speed issue (compare with web GPT-4 with the same question), gpt-4-1106-preview

RomanSh · January 25, 2024, 9:33am

Hello,
With the web GPT-4 version I can upload document and ask the same question (the same as by API with gpt-4-1106-preview), the document analyze takes up to 1-2 seconds, whole response takes up to 1 min (with my actions like open and upload file and copy question to thread). What should I do to reach the same speed with api? Why the api process takes up to 3-7 minutes?
The steps to reproduce the issue with api are:

Upload file
Create assistant with uploaded file id,
Create thread
Add Message and ask to analyze the docx file
Create run
Check run status
Get answer
I’ve tried to make shorter instruction for API and attache file to message instead of assistant, but got the same result.
The help center respond me with the common answer on my question without any arguments why so big difference. (looks like bot answer)
We use the service to analyze customers’ documents, it doesn’t fit us with this speed. Can you fix the issue?
__
Best regards

RomanSh · January 25, 2024, 11:23am

Will it be fixed in the GPT-4 turbo stable version?

Topic		Replies	Views
Assistant GPT 3.5 model API poor performance Bugs assistants-api	1	568	January 22, 2024
Why Assistants API is Slow? Any speed solution? API api-speed , openai , rag , assistants-api	15	7783	September 10, 2024
Assistants API Performance API api , assistants-api	11	2735	March 21, 2024
Gpt-3-turbo slow vs chatgpt 3 website API	5	1642	December 16, 2023
Slow response time with GPT-4 API	2	5067	March 19, 2023

Api speed issue (compare with web GPT-4 with the same question), gpt-4-1106-preview

Related topics