Preventing assistant from mentioning files

owen1 · April 3, 2025, 11:54am

I am using the assistants api with file retrieval and code interpreter tools. The assistant will frequently mention the files by name. I want it to act like it can just see the data and not explicitly reference the files, or talk about re-importing the files etc. I know this has been mentioned on the forum before but there doesn’t seem to have been any working solution, and wondering if any one has found a solution since.

To be clear, I don’t mind the annotations to files that are sometimes added to the response, those are easy to deal with. What I am talking abut is when the assistant explicitly references the files in the text as in: “It seems that the session reset has caused a loss of import statements. Let me correct that and re-import necessary modules to process the files.”

I have tried numerous prompts along the lines of "don’t mention the files, the user should be unaware of the existence of the files. " to absolutely no avail.

I know that I could run the response through another LLM to rewrite the response in a way that doesn’t mention the files, however I am using the streaming API as well, so this isn’t really an option.

_j · April 3, 2025, 12:08pm

It’s pretty much impossible to stop the AI from producing irrelevant recitations about files, or to stop free reign over them from an unprivileged user, even those files that are meant to be internal, when OpenAI has damaged Assistants and Responses in this cleverly droll manner:

And you get these abominations.

owen1 · April 3, 2025, 12:19pm

thanks @_j . That’s disappointing . I can add guardrails I suppose to stop the user asking questions like that.

Three questions for whoever knows them:

Are there any prompts that anyone has found that significantly minimise the chance of the agent mentioning files.
Are there any updates to the API in the pipeline that might “fix” this? …before I completely abandon this API.
Can this behaviour be changed using some other framework like LLamaindex that abstracts the assistant API?

_j · April 3, 2025, 1:23pm

The most straightforward way is to make direct use of vector stores - and not even have a search tool, just use input context RAG injection based on the context of what is being asked and some rewriting.

A new guide just for you:

Of course better is to use embeddings yourself - but also a higher hurdle than a service provided by the same provider as you language model.

Topic		Replies	Views
How to get assistants api to not refer to files or documents Prompting assistants , assistants-api	9	2569	November 1, 2024
Avoid explicit mention to retrieval and assistant files Prompting rag , assistants-api , knowledge-files	8	2216	April 10, 2025
Assistant referring to "the files uploaded" in the vector store Prompting assistants-api	6	340	April 4, 2025
Assistants inconsistency with attached vector store API api , assistants-api	1	197	December 16, 2024
Responses API file_search tool - issues and guidance API rag , file-uploads , file-search , responses-endpoint , responses	4	228	April 5, 2025

Preventing assistant from mentioning files

Related topics