Out-of-context questions in retrieval-augmented generation

andrasaponyi · October 16, 2023, 11:56am

Given a chatbot based on gpt-3.5-turbo in retrieval-augmented generation setting, where the model is asked to answer questions based on a provided context, what are some ways to stop it from answering questions or otherwise fulfilling requests that are not related to the context?

For example, suppose that this bot was deployed on a website in the medical domain. It should not be possible to ask it to create recommendations for how to write a good history essay, for example.

I’ve tried including in the prompt an instruction that the model should only answer questions that are related to the context and while this seems to work most of the time in English, it no longer does so in other languages I’ve experimented with. Another idea I’ve played around with is including an additional step where the model is asked to determine whether the question is answerable - this works somewhat better, but generates too many false negatives.

mvng.sl.001 · October 16, 2023, 12:53pm

Adding the following to your prompt either at the top where you might be setting system/role for LLM; or towards the end along with the question may help.

“You can only make conversations based on the provided context. If a response cannot be formed strictly using the context, politely say you don’t have knowledge about that topic.”

Bonadio · October 19, 2023, 11:37pm

Hi @andrasaponyi

I have the same problem, i found very difficult to force the model to only use the context. Here is my prompt

“”"

Please read the context provided below:
CONTEXT

{context_str}

Based solely on the information given in the context above, answer the following question. If the information isn’t available in the context to formulate an answer, simply reply with ‘NO_ANSWER’. Please do not provide additional explanations or information.

Question: {query_str}“”"

So far I found that:
1- gpt-3.5-turbo is very hard to only stay with the context
2- gpt-4 works well but is expensive
3- Google Vertex AI text-bison seems to work very well and the price is like gpt-3.5-turbo, the problem is that the responses seems to be shorter not so gentle.

andrasaponyi · October 20, 2023, 8:38am

Having the same exact experiences myself. Good to know that I’m not the only one struggling

Topic		Replies	Views
How to prevent ChatGPT-4 from answering questions that are outside our Context API gpt-4 , hallucinations , prompt-engineering	3	1192	August 16, 2024
GPT-4 keeps lying instead of saying "I don't know"! Prompting gpt-4 , hallucinations	6	6769	December 19, 2023
Forcing use of context information and suppressing everything else Prompting chatgpt	7	3815	December 19, 2023
How to prevent GPT-3.5 from referencing knowledge from its training and only use given context? API gpt-4 , gpt-35-turbo , chatgpt , api	4	1906	December 20, 2023
Changing prompts to remove references to context Prompting	11	8813	September 22, 2023

Out-of-context questions in retrieval-augmented generation

Please read the context provided below: CONTEXT

{context_str}

Related topics

Please read the context provided below:
CONTEXT