Curl api.openai.com is slow, 15-30s/req

nguyenanhdon.qn · March 8, 2023, 1:45am

My code example

    $url = 'https://api.openai.com/v1/chat/completions';
    $model = "gpt-3.5-turbo";
    $header = array(
        'Authorization: Bearer '.$API_KEY,
        'Content-type: application/json',
    );

    $params = json_encode(array(
        'messages'        => $data,
        'model'         => $model,
        'temperature'   => 1,
        'max_tokens'    => 1500,
        'top_p'         => 1,
        'frequency_penalty' => 0,
        'presence_penalty'  => 0
    ));


    $curl = curl_init($url);
    $options = array(
        CURLOPT_POST => true,
        CURLOPT_HTTPHEADER =>$header,
        CURLOPT_POSTFIELDS => $params,
        CURLOPT_RETURNTRANSFER => true,
        CURLOPT_SSL_VERIFYPEER => false,
        CURLOPT_SSL_VERIFYHOST=> false,
        CURLOPT_AUTOREFERER => true
    );
    curl_setopt_array($curl, $options);
    $response = curl_exec($curl);

Very slow response time from 15-30s

nguyenanhdon.qn · March 8, 2023, 2:03am

Update 08/03/2023: 8-9s/req

nguyenanhdon.qn · March 8, 2023, 10:13am

Have a demo here: https://chat.chatgptvietnam.org you can test speed req

benjieperez28 · March 8, 2023, 1:40pm

Hi, we are also experiencing this.

“cURL error 6: Could not resolve host: api.openai.com (see https://curl.haxx.se/libcurl/c/libcurl-errors.html) for https://api.openai.com/v1/chat/completions”

nguyenanhdon.qn · March 8, 2023, 2:10pm

Are you in the countries where the ping and curl errors arrive api.openai.com

ajondo · March 8, 2023, 5:58pm

I do curl with PHP. The problem is not that the site is slow - it only takes the API a very long time to finish. So, if you are not using the stream feature, you have to wait until the end. That can result even in server timeouts. In that case, increase your time limits in the php.ini and the timeout in your Apache server settings.

Something like this could help:
// Start streaming response
curl_setopt($ch, CURLOPT_WRITEFUNCTION, function($curl, $chunk) {
echo $chunk; // output received data to client
return strlen($chunk); // return length of received data
});

benjieperez28 · March 9, 2023, 12:14am

Yes our servers are on Philippines, and its really slow when it terms of prompting.

benjieperez28 · March 13, 2023, 6:50am

nguyenanhdon.qn:

My code example

    $url = 'https://api.openai.com/v1/chat/completions';
    $model = "gpt-3.5-turbo";
    $header = array(
        'Authorization: Bearer '.$API_KEY,
        'Content-type: application/json',
    );

    $params = json_encode(array(
        'messages'        => $data,
        'model'         => $model,
        'temperature'   => 1,
        'max_tokens'    => 1500,
        'top_p'         => 1,
        'frequency_penalty' => 0,
        'presence_penalty'  => 0
    ));


    $curl = curl_init($url);
    $options = array(
        CURLOPT_POST => true,
        CURLOPT_HTTPHEADER =>$header,
        CURLOPT_POSTFIELDS => $params,
        CURLOPT_RETURNTRANSFER => true,
        CURLOPT_SSL_VERIFYPEER => false,
        CURLOPT_SSL_VERIFYHOST=> false,
        CURLOPT_AUTOREFERER => true
    );
    curl_setopt_array($curl, $options);
    $response = curl_exec($curl);

Very slow response time from 15-30s

Is there any solution for this?

raymonddavey · March 13, 2023, 8:01am

You will have to do a lot more work - but by setting stream to true, you can give the user feedback while it is responding

It will look like the AI is typing on the screen

nguyenanhdon.qn · March 13, 2023, 8:52am

Is CURLOPT_WRITEFUNCTION
actually streams the files instead of waiting for full file to buffer.

benjieperez28 · March 14, 2023, 8:05am

I hope they fix this issue, its not suitable for a production grade if they will persist this in terms of latency issue.

ruby_coder · March 14, 2023, 8:31am

I think many often people confuse network delays and data center congestion, etc with API performance.

For example, I am 12 time zones away from the US and call the OpenAI completion API, and here are the results when I time the call:

`text-davinci-003`

Test 1:  Completions.get_reply Time: 1.247792 secs
Test 2:  Completions.get_reply Time: 5.038783 secs
Test 3:  Completions.get_reply Time: 1.289555 secs
Test 4:  Completions.get_reply Time: 2.205132 secs

Kindly keep in mind that I am testing OpenAI APIs from the opposite side of the world than the US.

Also, if I repeat for other models, the results are similar. It’s mostly network traffic issues, not model issues, from my experience.

Having said that, lately I have noticed that text-davinci-002 is about 0.5 seconds faster than text-davinci-003 (for the same prompt), but did not test extensively.

Appendix: Example Test

nguyenanhdon.qn · March 14, 2023, 10:00am

Thank you

`text-davinci-003`

I used text-davinci-003 before and the average response was 0.5-0.8s. But because of the price it is high.
I have to use the gpt-3.5-turbo-0301 or gpt-3.5-turbo model to reduce the cost but it has the problem of slow processing time.

ruby_coder · March 14, 2023, 10:07am

Will test turbo for you when back at my desk.

nguyenanhdon.qn · March 14, 2023, 10:09am

Thank you, I am using 30$ gpt-3.5-turbo daily it will be 300$ of text-davinci-003 LOL

ruby_coder · March 14, 2023, 10:29am

Here are some test results (just now) for turbo:

`gpt-3.5-turbo-0301`

Test 1, Completion API Time: 1.529 seconds
Test 2. Completion API Time: 2.504 seconds
Test 3. Completion API Time: 1.557 seconds
Test 4. Completion API Time: 1.513 seconds
Test 5. Completion API Time: 1.505 seconds

Appendix: Sample Chat Completion with Time

HTH

nguyenanhdon.qn · March 15, 2023, 1:56am

Unexpectedly, you can see the query sample

nguyenanhdon.qn · March 15, 2023, 4:05am

Py test: 12s

import openai

openai.api_key = "sk-..."

completion = openai.ChatCompletion.create(
    model="gpt-3.5-turbo",
    messages=[{"role": "user", "content": "Tell the world about the ChatGPT API in the style of a pirate."}]
)

print(completion)

ruby_coder · March 15, 2023, 4:07am

Same results from here at this time:

nguyenanhdon.qn · March 15, 2023, 4:10am

Your software is so good, can you give me information. I just tried with python and it took 12s

Topic		Replies	Views
Completion vs. chat performance API api-speed	3	3189	December 24, 2023
ChatGPT API responses are very slow API	31	28921	December 12, 2023
Slow Chat api responses ------ API	17	6377	December 24, 2023
Chatgpt-3.5 turbo model takes long time to respond. Is there any way to speed this up? API gpt-35-turbo , api-speed	7	6526	December 19, 2023
ChatGPT API Very Slow at generating Responses API gpt-4 , api	8	5165	December 25, 2023

Curl api.openai.com is slow, 15-30s/req

text-davinci-003

Appendix: Example Test

text-davinci-003

gpt-3.5-turbo-0301

Appendix: Sample Chat Completion with Time

Related topics

`text-davinci-003`

`text-davinci-003`

`gpt-3.5-turbo-0301`