Hangs with batch async API calls using asyncio.gather, even with StrictLimiter

zia1138 · January 31, 2025, 5:46pm

I run a lot of batch API calls using asyncio.gather() similar to the example below. I use asynciolimiter.StrictLimiter to limit the rate of API calls. However, I find that some of the calls just hang and take forever to complete. Is there a reason for this? Am I hitting some API limit? How could I prevent this? I also set the max_tokens to prevent the output from getting too long. Any insight would be appreciated.
Thanks!

import asyncio
from openai import AsyncOpenAI

aclient = AsyncOpenAI()

async def process_prompt(prompt):
    response = await aclient.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": prompt}],
    )
    return response.choices[0].message.content

async def main():
    prompts = ["Explain quantum entanglement", "Summarize WW2", "Define monad"]
    tasks = [process_prompt(prompt) for prompt in prompts]
    results = await asyncio.gather(*tasks)
    print(results)

asyncio.run(main())

zia1138 · January 31, 2025, 6:31pm

Looks like one solution is to use asyncio.wait_for on the completions.create call still not ideal, but it works.

Topic		Replies	Views
Parallelise calls to the API - is it possible and how? API	13	46193	December 13, 2023
Hi I want to pass multiple prompts in ChatCompletion create API Feedback gpt-4 , text-davinci-003	16	8787	March 31, 2024
Without using Batch API , how do users manage making large number of requests to OpenAI API	8	1560	July 18, 2024
Completion Endpoint Randomly Freezes API	10	7249	October 26, 2023
ChatCompletion API Call - HANGS without producing response API gpt-35-turbo , chatgpt , api	5	3248	December 16, 2023

Hangs with batch async API calls using asyncio.gather, even with StrictLimiter

Related topics