Issues with Incorrect Responses for Specific Legal Articles in Large Document Using Vector Store

felipeortizp · September 22, 2024, 2:06pm

Hello everyone,

I’m encountering an issue with my assistant when querying specific legal articles from a large document stored in a vector store. The document is quite extensive, with 2600 articles and around 500 pages (in .txt format, 2MB in size), split into chunks of 800 tokens with an overlap of 400 tokens.

The problem arises when I ask the assistant about a specific article (usually around 400-700 characters long). Instead of returning the correct response based on the exact article, it either returns a completely incorrect response or gives me details from another article.

Here’s a summary of my setup:

I’ve tried using different models (GPT-3.5, GPT-4, etc.).
I’ve experimented with various temperatures and Top P settings (currently at temperature 0.45 / Top P 0.78).
The document is stored in a vector store, and the instructions are designed for precise responses based on the content of the vector store.

Despite these attempts, I’m still getting incorrect results.

I’m wondering if there’s something I’m missing in my configuration or if there’s an issue with how the model handles this type of data. Could anyone help me optimize this setup or suggest further steps to improve the accuracy of responses?

Any guidance or a list of best practices for this type of implementation would be greatly appreciated!

Thanks in advance!

jr.2509 · September 22, 2024, 2:16pm

Hi and welcome to the Forum!

Have you had a chance to inspect the chunks that are being returned? Recently, OpenAI introduced a new capability to allow for that under the Assistants API: https://platform.openai.com/docs/assistants/tools/file-search/improve-file-search-result-relevance-with-chunk-ranking

I would start with that to see if the article in question is even properly retrieved and ranked during the file search.

kalenj · December 28, 2024, 2:48am

I added in the inspection of the file_search but it seems to be empty:

And this is happening even in cases where it’s clearly pulling the vectore store json file content into the answers it’s providing.

Although the reason I’m looking into this is because there are questions for which it is not answering correctly.

Also I’m doing this on the run not on the runStep. I don’t have any runStep in my code currently.

Topic		Replies	Views
How to Prevent Hallucinations When Extracting Verbatim Text from Files Using OpenAI Assistant API API assistants-api	6	255	January 16, 2025
Prompt returns answers from only one file in a vector store API gpt-4 , api , vector-store	3	112	September 11, 2024
Assistant File Search not answering correctly API assistant , assistants-files	0	209	July 29, 2024
How to refine the result from file search API	3	148	September 17, 2024
Improving File Search specificity w/ Assistant for accurate document processing API assistants-api , file-uploads	3	880	December 3, 2024

Issues with Incorrect Responses for Specific Legal Articles in Large Document Using Vector Store

Related topics