My response times using the Assistants API is taking 20+ seconds for simple prompts like “test” or “hello.” Is there any way to speed this up? I’m trying to use it for an application and currently I don’t see how it would be useful with response times this slow.
1 Like
Migrate off of Assistant API and use your own solution with Chat Completions.
You will be able to significantly reduce costs too.
1 Like
You can review the various options outlined in the documentation:
https://platform.openai.com/docs/guides/latency-optimization
With assistants, optimization is more challenging, but there is room for improvement.
1 Like