Limit knowledge retrieval to only pull from a list

pgartenburg · December 15, 2023, 8:53am

Hi,
I’m hoping someone has some insight here to help me out.

I’m trying to build an assistant that reads a news article that I provide it as a file. The assistants job is to categorize the article into appropriate keywords but I want those keywords to be constrained to a list of keywords that I have provided it in another file. The assistant responds with keywords that are relevant to the article but it comes up with it’s own outside of my list.

tldr:
Assistant has 2 files:

List of keywords.txt
News article.txt
I want it to respond with the keywords that pertain to the article but I want it to only pick from the list of keywords provided in the file.

Anyone have any tips for me here?
I know this will never be perfect but so far it isn’t even close to listening to my instructions.

I know I could improve by passing the list of keywords into the system message but I have to run this on 1000’s of articles daily and I want to keep costs down.

Thanks in advance.

jr.2509 · December 15, 2023, 11:33am

Hi - are you willing to share your instructions? I could take a look to see if I can identify areas for optimization.

I have a created an assistant that also must apply categorization as input for a SQL statement. In my case, I use a function call to create a JSON with the various categories. After lots of testing what I found works is to list all the options to chose from in the function call itself in the description for the individual properties. For some of the properties I have multiple hundreds of choices… This approach has achieved the best results as the assistant will strictly only draw on the choices provided.

However, based on my own experience you may be able to get close by using files provided the instructions are specific enough.

pgartenburg · December 16, 2023, 9:14am

Hi @jr.2509 Thanks so much for your offer to help and sorry for the delay in getting back to you. I can definitely share what I’m trying to do. Nothing so far has been that complex in terms of my prompting.

I’ve tried a few different ways so far:

List of keywords in a text file attached to the assistant and article attached to the user message
Both files attached to the assistant
Iterations on the two above with different prompts

Here is on of the prompts I have been trying:
You are provided with an article as an attached file. You match keywords to the provided article. Your job is to provide a list of keywords chosen from the attached list.

You will respond with keywords only found in the list provided. Do not respond with any keywords that aren’t in the provided list.

And here is another attempt
You match keywords to a provided article. You have been provided with a json file that has a list of parent keywords which are more general keywords and each has a list of associated sub keywords. Your job is to provide a list of keywords chosen from the attached list. It doesn’t matter whether the appropriate keywords are parent keywords or child keywords, just return a list of all keywords that apply

I’ve done a fair amount of prompting with chat models but given that I’m newer to knowledge retrieval, I can’t tell whether it’s my overall setup (i.e. what file goes where) or whether it’s my prompt.

Please let me know if you need me to share an example article and my list of keywords.

Again, thanks so much for your help. It’s much appreciated.

jr.2509 · December 16, 2023, 10:31am

No problem at all. Just to clarify - when you say prompt, do you refer to the assistant instructions or the prompt used in the interaction with the assistant during a thread?

pgartenburg · December 16, 2023, 5:25pm

Of all the things I’ve tried, that prompt was in the assistants instructions

Topic		Replies	Views
Assistant API very hesitant to use knowledge retrieval Prompting gpt-4 , api	14	3879	February 23, 2024
Optimal instructions to get Assistant with Retrieval Tool to Return all the Relevant Results Prompting gpt-4 , rag , assistants	7	8383	February 10, 2024
How to I get my GPT to only reference the attached documents and stop making up answers? GPT builders	6	2961	July 19, 2024
Keywords for my article text Prompting gpt-4 , api	5	827	September 7, 2024
Assistant referring to "the files uploaded" in the vector store Prompting assistants-api	6	353	April 4, 2025

Limit knowledge retrieval to only pull from a list

Related topics