Malformed streaming answers from GPT-4 completions API lately

elhjouji.zakaria · November 8, 2023, 5:43am

Since this morning, I’m noticing that the completions API is replying with malformed answers every now and then. Anyone noticing the same thing? Here is an example:

Chunk 1:
[
‘{“id”:“chatcmpl-8IVTD4XbqF5boxUIVcTJERVy075ww”,“object”:“chat.completion.chunk”,“created”:1699421567,“model”:"g’
]

Chunk 2:
[
‘pt-4-0613",“choices”:[{“index”:0,“delta”:{“content”:“In”},“finish_reason”:null}]}’
]

Normally, I parse each of these chunks to JSON first in order to extract “choices”, but these chunks cannot be parsed now. This is the first time I see this happening. Anyone noticed the same thing? I’m guessing it might be related to the outage happening now?

theoo · November 8, 2023, 9:37am

Same here. My API calls return gibberish text. However, the web application chat interface works fine.

fortminor · November 9, 2023, 9:28am

We face the same issue. Will this stick so should i implement a fix in my code or should we wait for a fix from openai team?

david55 · November 9, 2023, 3:02pm

I can confirm, this probably started within the last 24h, all of our previously working code (working for months) has started having a problem with the first couple of chunks.

We are talking about streaming mode here, so:

	const chatResponse = await openai.createChatCompletion({
		...options,
		messages,
		stream: true
	}, {
		responseType: 'stream'
	})

and then handling the stream (as per lots of example code - this has been working for a long time):

			chatStream.on('data', (chunk: Buffer) => {
				const messages = chunk
					.toString('utf8')
					.split('\n')
					.filter(m => m.length > 0)
				for (const message of messages) {
					... // skipped code
					const data = message.slice(dataMarker.length)
					... // skipped code
					const completion: CreateChatCompletionStreamResponse = JSON.parse(data)
					... // etc.

suddenly, the first chunk has a newline in it and results in two “messages”:

data: {"id":"chatcmpl-8J04wP33MFVY0pQoljcFiBsiBW3uC","object":"chat.completion.chunk","created":1699539226,"model":"gpt-4-0613","choices":[{"index":0,"delta":{"role":"assistant","content":""},"finish_reason":null}]}

data: {"id":"chatcmpl-8J04wP33MFVY0pQoljcFiBsiBW3uC","object":"chat.completion.chunk","created":1699539226,"model":"gpt-4-0613"

the first parses to “” the second fails to parse as it is not valid JSON (it’s only half the object)

then the next chunk continues:

,"choices":[{"index":0,"delta":{"content":"My"},"finish_reason":null}]}

which does not start with the data marker 'data: ’ so is ignored by code (like How to use stream: true? · Issue #18 · openai/openai-node · GitHub - which is pretty much the code we have been running for months)

the next and subsequent chunks come in as usual eg:

data: {"id":"chatcmpl-8J04wP33MFVY0pQoljcFiBsiBW3uC","object":"chat.completion.chunk","created":1699539226,"model":"gpt-4-0613","choices":[{"index":0,"delta":{"content":" journey"},"finish_reason":null}]}

It’s easy enough to mitigate probably, but why has this happened? Will it be reverted? One other issue opened about it: createChatCompletion Regression - payload is split across 2 chunks

Foxalabs · November 9, 2023, 4:24pm

EricGT · November 9, 2023, 9:58pm

#6 Something random so I can post similar

greeven · November 11, 2023, 5:44pm

The issue has not been resolved entirely.
The malformed streaming answers still persist.

testing · November 12, 2023, 3:59am

Is there any solution to this? I’m also experiencing this today

greeven · November 12, 2023, 2:16pm

The issue resolved itself for me. Working correctly again.

_j · November 12, 2023, 2:20pm

This exact symptom hit a few weeks ago, the answer: “you should wait for complete objects and fix them, it’s within the protocol rules”.

david55 · November 13, 2023, 1:40am

Whatever was causing it was temporary and seems to be resolved now.

I think that the point that the SSE code needs to buffer waiting for complete JSON objects is completely correct. I would imagine though that most people (myself included) are running code that is much like all the example code that openai has themselves provided, which does not do any such buffering.

I think the best thing to do is to use the latest API which exposes the stream result as an async iterable and you don’t have to worry about SSE responses or JSON parsing yourself… (that’s assuming that the npm package handles this case correctly itself…)

elhjouji.zakaria · November 13, 2023, 6:04am

Yes, that’s right. I just incorporated the openai npm package.

Topic		Replies	Views
BUG: Streaming packets changed Bugs chatgpt , api	23	2962	November 1, 2023
Was there an intentional change to the streaming responses? (multiple chunks in stream event) API bug , api , streaming	9	2615	July 23, 2024
createChatCompletion Regression - payload is split across 2 chunks Bugs gpt-4 , chatgpt , api	4	1153	November 12, 2023
High rate of invalid JSON response when streaming response API api	8	4443	August 14, 2023
GPT-4 model, unexpected returns in stream mode API gpt-4 , api	10	3318	December 16, 2023

Malformed streaming answers from GPT-4 completions API lately

Related topics