High rate of invalid JSON response when streaming response

georg-san · June 27, 2023, 1:30am

I’m using NodeJS to call createChatCompletion and stream the response, like so:

 async function openAiCall(maxResponseTokens, messages, modelName) {
      const response = await openai.createChatCompletion(
        {
          temperature: 0.7,
          max_tokens: maxResponseTokens,
          top_p: 1,
          frequency_penalty: 0,
          presence_penalty: 0,
          user: uid,
          model: modelName,
          stream: true,
          messages,
        },
        { responseType: "stream" }
      );

      return response;
    }

I then listen to each the response like so:

...
 response.data.on("data", async (data) => {
          const lines = data.toString().split("\n");

          for (const line of lines) {
            const message = line.replace(/^data: /, "");
           
            if (message === "[DONE]")  return;
            
            if (!message) continue;

            let text = "";

            try {
              const parsed = JSON.parse(message);
              if (parsed) text = parsed.choices[0].delta?.content;
            
 // Message can't be parsed
...

While the function and the response work correctly, the volume of invalid JSON responses for each line has increased a lot. It’s especially high volume with these models: gpt-3.5-turbo-0613, gpt-3.5-turbo-16k-0613 and gpt-3.5-turbo.

This used to happen less frequently (~ once for every 1k lines) but not happens ~50-100 for 1k lines.

Anyone else experiencing this? Is there anything that can be done to mitigate this issue?

AiMachineDream · June 27, 2023, 2:36am

You should have 0 invalid responses. Something in your streaming code is off.

georg-san · June 27, 2023, 2:41am

Some lines when looping through for (const line of lines) { are incomplete (missing a chunk at the beginning or end. Here an example ,"created":1687829221,"model":"gpt-3.5-turbo-16k-0613","choices":[{"index":0,"delta":{"content":" of"},"finish_reason":null}]}

novaphil · June 27, 2023, 1:16pm

Suggested code from OpenAI here uses:

const lines = data.toString().split('\n').filter(line => line.trim() !== '');

There’s also a wide variety of other npm packages and code snippets in that GitHub issue.

lipzai0625 · July 10, 2023, 3:31am

I’m using the API directly and facing this issue as well.

As you can see from the image below, the json from first chunk is not completed yet and it continue on the second data chunk. Causing invalid json format issue.

@georg-san Did u managed to find some workaround?

lipzai0625 · July 10, 2023, 3:39am

Think even OpenAI Playground is facing this issue.

anon10827405 · July 10, 2023, 4:31am

Use the V4 beta for streaming

github.com

openai/openai-node/blob/v4/examples/demo.ts

#!/usr/bin/env yarn tsn -T

import OpenAI from 'openai';

// gets API Key from environment variable OPENAI_API_KEY
const client = new OpenAI();

async function main() {
  // Non-streaming:
  const result = await client.completions.create({
    prompt: 'Say this is a test',
    model: 'text-davinci-003',
  });
  console.log(result.choices[0]!.text);

  // Streaming:
  const stream = await client.completions.create({
    prompt: 'Say this is a test',
    model: 'text-davinci-003',
    stream: true,

This file has been truncated. show original

lipzai0625 · July 11, 2023, 2:13am

@anon10827405 Thanks for sharing, gonna try now!

baresi687 · August 14, 2023, 10:14am

I’m not using the OpenAI package but I had the same problem.
Sometimes the chunks are incomplete and will continue on the next loop, giving parse errors.

I solved this in my case by detecting when the chunks do not end with completed objects = ‘}]}’
Then storing that line in variable and removing it from lines array.
And on the next loop add it to the first index of lines.
Ex. lines[0] = inCompleteChunk + lines[0]

Topic		Replies	Views
Malformed streaming answers from GPT-4 completions API lately API	11	2301	November 13, 2023
Parsing JSON stream response in nodejs API api , json	7	18010	August 3, 2024
GPT-4 model, unexpected returns in stream mode API gpt-4 , api	10	3316	December 16, 2023
Data on completions stream response is cut off in the middle API api , api-streaming	1	330	July 8, 2024
Was there an intentional change to the streaming responses? (multiple chunks in stream event) API bug , api , streaming	9	2595	July 23, 2024

High rate of invalid JSON response when streaming response

Related topics