RAG bot doesn't understand sources are not exhaustive

foschi · February 21, 2025, 8:57am

I’m about to deploy a company RAG system addressed at potential customers. However, I’m realising now I couldn’t quite avoid the bot from falling into a big misconception: even if I explain in the system message that this is not the case, it answers questions as if retrieved documents were exhaustive to the question raised. For instance, if I ask to the bot how many new projects were started in my company last year, it will only count those described in the retrieved documents, even though, again, I compel the bot to consider that there may more information than that retrieved and that they may rather suggest the user to write us a mail than giving definitive answers.

This is puzzling me quite a bit, because I don’t want the bot to give the impressions that we do just a fraction of our actual activities. Did anyone face this problem before? How to solve it?

pretendlake · February 21, 2025, 9:27am

Try adding real examples in the system prompt which simulates a scenario with:

Sample data
A user message
The answer of the assistant

I’ve found this usually works very well.

Feel free to share your prompt with us if that doesn’t fix it.

sergeliatko · February 21, 2025, 9:32am

I understand your struggle. Just a couple of quick questions:

Why not giving the bot and API to query the number of projects and get the exact results? So that it is not the bot but a database who provides the definitive answer.
If you prefer that the bot asks the user to send an email for this type of information, does this bot has clear instructions about what kind of information is answerable and what kind of information needs to be provided by human via email or other means?
If you consider that this bot should not answer this type of questions, why you have not implemented a filter logic to prevent this type of questions get to the bot? (Because after all, the bot’s nature forces it to answer user inquiry).

foschi · February 24, 2025, 10:14am

thank you both for your answers. I managed to get the bot to not give complete answers with incomplete information by just tweaking the prompt in a way that stresses more that directive. It wouldn’t comply without repeating it in all caps, but now it does.

Unfortunately, my project doesn’t have the budget to consider most of your suggested solutions, and by the way we don’t really expect users to make questions that require an exhaustive scan of the database - my example was more of an edge case than anything - so I think we are good now.

vb · February 26, 2025, 10:14am

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Instruction to exclude certain information when generating answer Prompting gpt-35-turbo , chatgpt , api	8	3433	November 4, 2023
Assistant api don't access the database Prompting gpt-4 , chatgpt , api , lost-user , assistants-api	3	365	July 4, 2024
URGENT HELP: Issues with Overlapping of data Community gpt-4	0	359	July 24, 2023
How to avoid answers like 'yes...' or 'no...' and force to expatiate with more related info API	5	1280	March 9, 2023
Getting wrong answers after a DB query Plugins / Actions builders chatgpt	6	1001	November 29, 2023

RAG bot doesn't understand sources are not exhaustive

Related topics