Follow-up questions in a QnA environment

apavia · February 23, 2023, 9:16am

I’m working on a QnA system for the business I work at. We have a big knowledge store of long documents, that change every now and then. I’ve build a script to split the documents in smaller documents (up to around 1000 tokens) then, using the embeddings API, I create the embeddings and store them. Then the user asks a question, which is also embedded and then use the cosine similarity to find the appropriate top k text fragments and call the completion API. So far so good.
However, I was wondering, how would I create a system where the user can either ask follow-up questions, where the question won’t have enough context and consequently the embedding-search mechanism wouldn’t work on the question alone, or ask questions on completely different topics.

I’ve thought of maybe summarizing the previous context and adding it to the prompt but I fear that could ‘harm’ the search if the question is on a different topic.

My question is, what is the best way to approach this problem? Multiple API calls are not and issue, and fine-tuning wouldn’t be a problem either.
Thanks.

mitxelmo · November 17, 2023, 12:35pm

Hi @apavia I am working on something very similar and I was thinking about passing to the embeddings-search function a modified query. I think it is safe to assume the user will do mostly follow-up questions, since it is simply possible to create new “conversations” to change the topic radically. With this in mind, my first idea would be to append the previous n queries to the current n+1 query. I’ll give it a try shortly, let me know if you did something else and how well it went

mitxelmo · November 17, 2023, 1:00pm

I ended up appending also part of the assistant responses to the subsequent inputs, since sometimes the content of the follow up questions is actually given by the AI assistant responses. So far that makes the retrieved “chunks” to be more “accurate”, according to my quick tests.

Topic		Replies	Views
Context aware context for follow-up question Prompting embeddings , gpt-4 , api	13	8446	October 16, 2024
How can I deal with follow up question in the conversation? API	6	4821	December 24, 2023
Vector database QnA answering based on info from multiple replies Prompting chatgpt	4	2549	September 25, 2023
Maintain context for a chat bot based on embeddings? API	2	442	April 2, 2024
Fine-tuning problem API	4	2055	December 19, 2022

Follow-up questions in a QnA environment

Related topics