GPT4 Streaming doesn't use all information from retrieved document

johananleem · September 6, 2023, 5:39am

Hi,
I am using a Retrieval Augmented Generation pattern as a QnA solution. When a query is sent, documents are retrieved from an Azure Cognitive Search index. The retrieved documents are the streamed into OpenAI GPT-4 using POST method.
The generated text is then streamed out and sent to my app.

Occasionally, when the text in the document is large, GPT-4 would respond that it has no information on the topic. This is especially the case if queries that ask about the later half of a document or if multiple documents are needed to make a summary.
I’m not entirely sure of the cause, but I suspect that GPT-4 is generating text as it is being streamed and tries to conclude the output text before it reaches to the relevant parts.

Are there any fixes to this? And what are the limits of streaming to GPT-4 that I should be aware of?
I am using Python 3.10 and my GPT- model is accessed using Azure OpenAI.

_j · September 6, 2023, 7:18am

That doesn’t make any sense. The AI must be passed the full documentation and instructions of how to act on it in a format it can understand. Only when the input context is loaded can the AI then form a coherent answer.

BrianLovesAI · September 8, 2023, 4:47am

I think that would not happen normally. I saw hallucination issues, but not like that, saying no document found. In your case, it raises suspicion that the document wasn’t properly searched for. The best approach would be to log the input value just before sending a request to GPT-4 for verification.

I’m currently using both stream true/false and processing a lot of queries. However, I haven’t come across this issue unless the document was not provided properly.

Topic		Replies	Views
Streaming Response Keeps on Breaking API gpt-4	4	1231	July 3, 2024
Is it possible to stream large text as input to the ChatGPT API? API	1	2547	March 19, 2023
GPT-4 Streaming Output Radically Different than Static Output API	4	1630	October 9, 2023
GPT-4 model, unexpected returns in stream mode API gpt-4 , api	10	3347	December 16, 2023
In GPT4 streamed responses all chunks come in a single batch API streaming	4	3343	June 7, 2024

GPT4 Streaming doesn't use all information from retrieved document

Related topics