How to reduce latency when calling the OpenAI API?

logankilpatrick · January 20, 2023, 3:25pm

Hey folks, we just shipped an updated section of our “Production best practices” guide on Latency and how to reduce its impact on your application. S/o to @shyamal for pushing this forward:

michael10 · February 27, 2023, 11:08pm

Still slow AF. Takes about 13 seconds to return completions - was about 7 seconds a few days ago, which is still really slow. What’t going on?

ruby_coder · February 28, 2023, 3:40am

It’s no secret “what is going on” @michael10

The number of OpenAI users have increased at the largest rate of any Internet technology in the history of the world.

The last I read, OpenAI went from 1 million users to 100 million in about a months time just recently. Not sure what the user count is now, but OpenAI’s infrastructure is currently fragile and OpenAI is working to “beef it up” to meet the demands.

Hope this helps.

Topic		Replies	Views
How to reduce OpenAI response time? API	13	17588	December 13, 2023
API Very Slow Since 2023-01-05 API	6	2855	October 31, 2023
20, 30 sec assistants API answer Feedback api , assistants-api	11	601	February 21, 2025
Huge Latency since 03/22/2023 on Make API	8	1175	March 23, 2023
Is/Was there a slow down in OpenAI Api ? (Increased Latency) API api , chat-completion	8	1847	December 24, 2023

How to reduce latency when calling the OpenAI API?

Related topics