Vector database QnA answering based on info from multiple replies

dinoel.tanas · September 18, 2023, 8:18am

Hello everybody,

What I am working on

I am working on a question-answering system. Our data is in the format of Question-Answer pairs.
I am using a vector database to search for matching questions.

Initial and current state

Initially, I was embedding only the questions but soon realized that sometimes, because of how the user asks the questions, the vector similarity search would not match the “question” field in the vector database, but the answer that the user is looking for is present in the “answer” field in the vector database. Then I combined the Question and Answer when computing the embedding. This improved the result, but I still found a case where it was not working well:

Current problem

I am using only the last user query to search the vector database, but people usually expect the bot to use detailes specified above. Of course, using multiple “user”, “assistant” messages in ChatGpt is possible, but when searching the vector database, I don’t use the whole conversation.
One example is if the user asked a question, got an answer from the bot, but wants a clarification. The user says: “I actually have X type of appointment”. When querying the database, I would like to query with a phrase that contains both the question that the user previously asked + the details that he provided in the last reply to get more relevant responses.

Question

Is there a method of improving the flow of the conversations, like using chatgpt functions, a scratchpad…? With these types of systems, normal users expect them to behave like humans, and they ask the question as if they’re talking to a human.
Thank you

Foxalabs · September 18, 2023, 8:49am

Hi and welcome to the Developer Forum!

You could pass the last few thousand tokens worth of interactions (system, user and assistant) to the model and prompt it with this:

(prior system/user/assistant role messages)+
“Given the above context, what is the best vector retrieval search prompt to use to get the most relevant embedding chunks for this query ###{new_user_query}###”

This will allow the model to produce the best search term for the users latest question and then you can use the retrievals from that search term as context for the question asked

Apkc · September 25, 2023, 2:07pm

(prior system/user/assistant role messages)+
“Given the above context, what is the best vector retrieval search prompt to use to get the most relevant embedding chunks for this query ###{new_user_query}###”

What would be the role of the last message? User?

curt.kennedy · September 25, 2023, 5:54pm

Yes user most likely, since the previous array element would be assistant, this is also called HyDE. Very powerful way of getting the AI to create phrases that correlate with the conversation, to aide your correlation.

N2U · September 25, 2023, 6:01pm

If you’re doing this, my best advice is to play around with the prompt, to find a way of generating something that has the largest correlation with your data, ie, if you have a dataset full of scientific abstracts you’re searching through, then you need to generate something similar to that.

Topic		Replies	Views
How can I send vectors as a chat context? Prompting embeddings	8	7936	May 15, 2023
Optimizing Chatbot Conversations: Strategies for Effective User-Refinement Integration Prompting gpt-35	1	852	December 11, 2023
Dynamic System Prompt using vector searches on chat history and profile documents Prompting gpt-4 , chatgpt , langchain	3	606	May 14, 2024
Prompt context formatting best practice Prompting api	7	3258	July 24, 2023
Context aware context for follow-up question Prompting embeddings , gpt-4 , api	13	8263	October 16, 2024

Vector database QnA answering based on info from multiple replies

What I am working on

Initial and current state

Current problem

Question

Related topics