How can I make my assistant responses faster?

My response times using the Assistants API is taking 20+ seconds for simple prompts like “test” or “hello.” Is there any way to speed this up? I’m trying to use it for an application and currently I don’t see how it would be useful with response times this slow.

1 Like

Migrate off of Assistant API and use your own solution with Chat Completions.

You will be able to significantly reduce costs too.

1 Like

You can review the various options outlined in the documentation:

https://platform.openai.com/docs/guides/latency-optimization

With assistants, optimization is more challenging, but there is room for improvement.

1 Like