Text-davinci-003 response time slowing beyond 30-45 seconds for completion

andarmmanik · June 22, 2023, 4:34pm

The system we’re designing creates many small text-davinci calls. The overall speed of the system is limited by this response time. Would it be more optimal to batch these into a single query or will these response times still remain high?

albina · June 22, 2023, 4:43pm

As far as I know, batching multiple small text-based queries into a single query helps minimize overhead and allows for parallel processing. However, essay when you manage the batch size, consider the trade-off between reduced overhead and increased latency.

Topic		Replies	Views
Too long response time on API gpt-3.5-turbo model API	3	1580	December 25, 2023
Parallel API Requests - Very Long Response Times API	4	747	July 26, 2024
API calls to code-davinci-002 so slow API codex	1	658	March 11, 2023
Issues with Rate Limiting and Batch Processing in OpenAI API Community api , batching	0	1671	November 11, 2023
Open AI Reponse is Slow API	3	7106	December 25, 2023

Text-davinci-003 response time slowing beyond 30-45 seconds for completion

Related Topics