GPT (gpt4) API works much slower than Playground

daiki.ichikawa · August 11, 2023, 7:38am

Hello, I have a question.

I’m testing GPT API with OpenAI Playground.
I use “chat” mode with “gpt4” model, asking some questions.
It works just like chatGPT, the response time is quick (usually a few seconds before it starts typing).

However, when I call GPT API from my local environment with the same setup (“chat” mode with “gpt4” model) to ask the same question which I did in Playground, the response is very slow (about 5~10 times more than Playground).

Why does this happen, and are there any solutions for it?

The source code is below. It’s copied from Playground.
I tried setting stream param to true, but it does not help.

const configuration = new Configuration({
  apiKey: process.env.OPENAI_API_KEY,
});
const openai = new OpenAIApi(configuration);

const response = await openai.createChatCompletion({
  model: "gpt-4",
  messages: [
    {
      "role": "user",
      "content": "Create a comparison table for Toyota, Tesla, and Honda in markdown format."
    }
  ],
  temperature: 1,
  max_tokens: 256,
  top_p: 1,
  frequency_penalty: 0,
  presence_penalty: 0,
 // stream: true
});

Thanks,

Foxalabs · August 11, 2023, 7:46am

Hi,

In the playground you are using the super fast GPT-3.5-Turbo model, so it’s quick, and then in your code version you are calling the slower and more powerful GPT-4 model, you would expect there to be a significant difference between the two.

Topic		Replies	Views
OpenAI API takes too long to response API api	2	787	March 25, 2024
Why is Api Slower Than Playground? API	8	2053	June 7, 2024
Why is GPT 4's response and performance on playground is so different from when using chatgpt 4 API gpt-4	12	8585	December 16, 2023
GPT 4 API is Very Slow Still API gpt-4 , chatgpt , api	15	6644	December 16, 2023
GPT-4 extremely slow compared to 3.5 API	15	8445	December 17, 2023

GPT (gpt4) API works much slower than Playground

Related topics