Asynchronous version of the API?

pulauten · October 16, 2023, 4:27pm

If I am calling OpenAI API from an AWS Lambda node.js code (or actually any server code), the server code needs to wait until it gets the reply from the OpenAI API. The response can take tens of seconds. During this time the server code must be running so I am paying for the execution time even if I would await for the execution of the API call in an asychronous way. The Lambda code must be running and holding the connection to the OpenAI API.
Currently I am using PHP for calling the API. There the script must run all the time and wait for the API call to complete. So I spend quite a lot of server resources since the waiting is long. Is there a way how to workaround this? If OpenAI had an asychronous API where you would send a request and could poll for the response, that would save a lot of server time. Is there a solution like this?

Diet · October 16, 2023, 9:43pm

Perhaps lambdas/functions might not be the best choice for you.

I don’t remember the exact numbers but IIRC node can handle tens (if not hundreds) of thousands of parallel connections in a single instance. spawning a new container instance for every request is obviously a huge waste.

If it’s technically too involved for you /don’t wanna deal with it, maybe consider using one of those bot building platforms.

edit: cloud foundry used to be really good for stuff like this, but it seems like cf seems to be slowly becoming less and less accessible.

curt.kennedy · October 17, 2023, 12:00am

The API is 100% synchronous right now.

To get something back instantly, look into streaming through the API.

Topic		Replies	Views
HTTP Calls Excessive Delay Waiting for Server Response API gpt-4	8	1156	January 9, 2024
Do you use streaming? Any difficulties with that? API api-speed , api	4	4335	July 3, 2024
Using Asynchronous Client with AsyncOpenAI API api , assistants-api	12	51908	October 8, 2024
Speeding up the response from the openai's assistant api API gpt-4 , assistants-api	2	2241	July 17, 2024
Batching multiple calls to openai in a serverless context API	6	1795	May 24, 2023

Asynchronous version of the API?

Related topics