Assistant API v2 / Revised answers for better future response

paul.merks · September 11, 2024, 1:03pm

Hello,

I am wondering how to take care of something. We have build a website scraper which scrapes all webpages on a website into Markdown format. This pages are combined into a single .md file and uploaded to a vector store. This is the main information for our agent. The instruction on Assistant level is set that it can only answer questions related to information found in the knownledge base.

We also have build a dasbhoard where you can see what conversations have been going on, and what the assistant has answered. We have created a table to store “Revised answers” where we store the user input and the wanted response. All revised answsers are also stored in a markdown file format and uploaded to the file cluster.

Now here comes the problem. What content should be in;

The prompt/instruction
Files in the cluster
Perhaps fined tunings for custom model.

We now add markdown info the cluster files, but I dont have the feeling that the assistant is using these “Revised Q/As” very good in new conversations. Any tips for this?

MARK0 · September 11, 2024, 1:28pm

I would split the scraped websites from the revised answers into a separate vectorstore collection. Then, I would either implement a bit more complex tool function that would first search collection with the revised answers. If no answer is found, then I would search the other collection.
Or better, I would use LangGraph to implement a “state machine” where I would implement/handle the logic when to search what and from where. I find LangGraph excellent if you are familiar with state-machines and you don’t have to use LangChain for the assistant/competition.

paul.merks · September 11, 2024, 1:45pm

Hi Mark0, thanks for your suggestion. I was hoping to get this done using the OpenAI assistants v2 APIs only, and not have to build something completely new

paul.merks · September 11, 2024, 1:45pm

Although, the tip with the seperated vectorstores does not seem that difficult btw

paul.merks · September 17, 2024, 7:50am

I fixed this by changing the prompt to always factcheck the response based on content in the files (files in the vector DB). Next to this, I also instruct the Assistants API to use the ToolsConstraint “file_search” as a required tool. This forces the Assistants API to always do a fetch of data in the vector database. If you do not give this hint, the Assistants API might respond without searching the vector DB.

This is how this looks in my C# code (using the C# OpenAI Beta SDK):

var runOptions = new RunCreationOptions() { AdditionalMessages = { content } };
runOptions.ToolConstraint = new ToolConstraint(new FileSearchToolDefinition() { MaxResults = 20 });

Works quite good now! and by using the GPT4o-mini model, it is quite cheap as well.

Topic		Replies	Views
Assistants v2 file_search not using the files in a consistent way API assistants , assistants-api	9	1749	September 17, 2024
Assistants inconsistency with attached vector store API api , assistants-api	1	224	December 16, 2024
Assistant referring to "the files uploaded" in the vector store Prompting assistants-api	6	411	April 4, 2025
Design advice - one or many Assistants? API assistants , assistants-api , assistants-files , vector-store	10	615	September 21, 2024
File search and token usage (assistant) API gpt-4 , chatgpt , assistants-api	5	355	February 11, 2025

Assistant API v2 / Revised answers for better future response

Related topics