How to fix Wrong Answer in Q&A

harryporter319193 · August 7, 2023, 5:19pm

Hello everyone.
Now I’m developing Q&A system with custom knowledge base.
I have used Pinecone to embed vector and retrieve data.
I have embedded two documents, for example, financial statement pdf file of Google and Apple.
When question is “Which shareholders are common between Google and Apple?”, system generates wrong answer.
When asking about the shareholder of Google, it makes right answer.
In other words, individually it generates right answer but in combination wrong answer.
I have used gpt-3.5-turbo-16k model.
Please let me know the reason of this and how to fix it.
Thanks.

Foxalabs · August 7, 2023, 5:27pm

Hi,

You should ask the model to create the search term to use, i.e. “Given this question ###{user_query}### what would be the ideal vector retrieval search prompt?” then use the result of that and vectorise it and search against that, then give the entire sequence, so including the first part and it’s answer along with the context of the vector retrieval and then ask the model to answer the original question, but this time for an answer, rather than a search prompt.

Get User Input
Ask LLM for best vector search to use for {user_input}
Vectorise response
Search for that vector and return the top 3-5 results
Add all of that text to a new prompt and include a request to now solve the original {user_input} query given all of the above information.

harryporter319193 · August 7, 2023, 5:33pm

Thanks for your quick reply.
I understand you.
But frankly speaking, now retrieved context to generate answer is enough to make exact answer.
So I think there is no need to use your method.

kalenj · December 28, 2024, 12:48am

When you say to vectorize the response and search for that vector, are you talking about having a vector database like pinecone that you’re using in conjunction with chatgpt?

I’m currently just uploading json files to the assistant.

Foxalabs · December 28, 2024, 1:41pm

Yup, the assistants API abstracts away a bunch of the RAG pipeline, but for many of the use cases I deal with I need more granular control over that pipe. So I implement those steps, systems like Pinecone have their own assistants like API, but again, it’s removing fine control from the flow, so I opt out of it.

You can also make use of Vector DB’s like ChromaDB which you can just spin up loacally and use in a fairly lightweight footprint.

If you wish to take a bunch of the leg work out you can use framworks like LlamaIndex to handle most of the io and functionality while still keeping control over most aspects.

All depends on your use case and need for granularity.

Topic		Replies	Views
The problem in Q&A system based on custom knowledge base API	5	738	August 4, 2023
Vector database QnA answering based on info from multiple replies Prompting chatgpt	4	3030	September 25, 2023
Use of davinci for question and answer from knowledge documents API	10	2155	July 24, 2023
How to build custom Q&A app? API	7	2947	February 16, 2023
Need help! fine tune worked using API, but the answers were bad API	13	1621	December 27, 2023

How to fix Wrong Answer in Q&A

Related topics