Davinci-text-003 Response Times

sahin.senol3 · November 29, 2022, 1:20pm

Today i made couple of test using davinci-text-002 and davinci-text-003.

I am using Google App Script to send the requests to openai. So basically i am getting prompts from different cells and sending API request one by one…

Here is the results:

davinci-text-002:

1st prompt: 3 sec
2nd prompt: 2 sec
3rd prompt: 2 sec
4th prompt: 4 sec
5th prompt: 2 sec
6th prompt: 7 sec !
7th prompt: 1 sec
8th prompt: 3 sec

Total: 24 sec.

davinci-text-003:

1st prompt: 4 sec
2nd prompt: 4 sec
3rd prompt: 3 sec
4th prompt: 9 sec !!
5th prompt: 3 sec
6th prompt: 12 sec !!!
7th prompt: 4 sec
8th prompt: 15 sec !!!

Total: 54 sec.

Seems like there is huge difference in terms of response time.

Does anyone know what might be the reason? I mean, i am sending exact same prompts but 002 responding me in total of 24 seconds whereas 003 response time is 54 seconds. More than double. Any thoughts?

nl.lorenz · November 29, 2022, 7:53pm

Yes I have made a similar experience. The response times are signifcantly longer.

PaulBellow · November 29, 2022, 11:34pm

While I don’t work for OpenAI, and I don’t have inside details, I would guess the higher quality output (including longer) is responsible for the increased time. I’m sure it’s a balancing act…

kitson.broadhurst · November 30, 2022, 3:54pm

Getting longer response times too for davinci on 003 over 002. It’s causing timeout limits on my serverless functions so while it’s producing better content it’s maybe not so useful because it’s taking so long to respond.
It would be good to get some update from the OpenAI team, I know it’s new so may be teething problems?

smith1302 · November 30, 2022, 5:52pm

Davinci-003 provides a longer output on average as well. Did you factor this into your response times? Longer outputs generally take longer, so that could be part of it, in addition to higher quality outputs.

duerr.simon · December 4, 2022, 3:24pm

I’m having similar experiences with 003 when it comes to response times. Output is consistently better, but at times takes quite long to load. Then, other times, it’s as fast as 002. The length of the output does not really seem to be a factor here, sometimes it takes a few seconds even for a simply reply.

Is this known and intended? Or something that can and will be improved upon?

jonathan.sabbah · December 4, 2022, 5:25pm

Same problem here, if that’s a bug from openai a quick fix would be appreciated

duerr.simon · December 4, 2022, 6:04pm

Just a few minutes ago I tried again, and this time 003 responded much faster! Similar to 002. Maybe something was fixed already? At least I know it can be fast now

// brief update: today I experienced some longer response times again. It seems to fluctuate (… based on load on their api? )

jonathan.sabbah · December 5, 2022, 8:02pm

Maybe the large amount of trafic coming for ChatGPT (I read that they get to 1m users!) is taking a part of the processing power that is normally dedicated for instructGPT

duerr.simon · December 6, 2022, 8:16am

Seems like that’s what’s happening. Just interesting that davinci-002 is not affected; apparently 003 and chatGPT share some infrastructure in that case.

atulsingh · December 24, 2022, 2:05pm

My Article writer tool started showing Timeout error since last 2-3 days. Earlier it was generating awesome results on davinci-003. Any resolution to this apart from moving back to davinci-002?

P.S. Other tools working fine on davinci-003 however, being generating a longer output, the Article writer tool is experiencing Timeout.

error- Failed to load resource: the server responded with a status of 524 ()

danrasmuson · December 28, 2022, 5:04pm

Presently I’m receiving ~23.968s response times on text-davinci-003 create-completion. Does the OpenAI team have a response time SLA on this endpoint?

I’m hoping for an update on the status page for these large response times as it really impacts the applications we are building.

iCopy · January 16, 2023, 10:25pm

Is anyone experiencing a much longer response time to your app? We have an app and the response time has gotten worse. Interestingly enough some of our competitor’s response time is faster. Any do’s and don’t to minimize the response time?

alex.batista · January 20, 2023, 4:30pm

@logankilpatrick and his team added a section in the documentation regarding this subject. The are good tips about it:

Thanks again @logankilpatrick

Topic		Replies	Views
API calls to davinci text 3 very slow and random speeds for identical prompts API	27	6933	December 25, 2023
Slow Chat api responses ------ API	17	6442	December 24, 2023
How to reduce OpenAI response time? API	13	17650	December 13, 2023
API Response time seem to have gone up significantly API	2	693	December 25, 2023
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	3	22128	November 9, 2023

Davinci-text-003 Response Times

Related topics