Assistant GPT 3.5 model API poor performance

nikritikos · January 22, 2024, 2:19pm

Hello,

We are using GPT 3.5 turbo 26k model assistant, and trigger thread creation, runs and message creation from the API. The performance is bad, leading to the application being unpresentable, thus useless.
We understand that GPT4 turbo, the latest models etc., are much slower, but shouldnt the most decent model (regarding reasoning) of the past releases be more decent also in response times?
Or is this because of the beta version?

The average response time is 10-15sec, sometimes even more than that.

Please, we would like to have an official answer as this is a company need, not just experimenting.

Thank you.

Foxalabs · January 22, 2024, 2:48pm

Hi and welcome to the Developer Forum!

The Assistants API is still in development and is not suitable for a production environment. You should always make sure your clients and customers are aware of the cutting edge and rapid development and related issues with AI products.

All API based services will encounter outages and communication issues, AI is especially prone to these issues as so much of what is being offered is totally new and there are no previous systems to look back on for methodologies.

Topic		Replies	Views
Assistant API Performance is Very Slow API plugin-development , api	11	2982	March 7, 2024
Custom Assistants running slow API gpt-4 , api , assistants , assistants-api	1	1100	November 10, 2023
Response of gpt-4-turbo is taking more time API gpt-4-turbo , assistants-api	9	1940	December 11, 2023
Assistants API Performance API api , assistants-api	11	1950	March 21, 2024
API call latency poses an issue API api	0	134	April 15, 2024

Assistant GPT 3.5 model API poor performance

Related Topics