Knowledge Retrieval - How to optimize the process

luona.dev · December 1, 2023, 4:42pm

Hi there,

To get a better understanding of how the knowledge retrieval process works behind the scenes, I conducted a series of experiments, each requiring a different approach of the retrieval process. Initially I just aimed to figure out, if the file size and the location of the information within the file has an impact on runtime and accuracy. But the focus shifted towards are more general question on how to best utilize the knowledge retrieval tool. Here are my key findings:

For trivial search tasks, the file size or the location of the information within the file does not matter.
For complex search tasks it does.
Some level of “abstraction” is performed if neccesary. E.g. it searches for synonyms or changes the threshold of the vector search.
Formatting matters!
There was an undocumented limit of 2M tokens for knowledge files (thanks @logankilpatrick for adding it!).
Everything is quite inconsistent and beta and sometimes the assistants just goes rogue and you might end up with $0.50 API calls without a decent answer.
A lot feels like a black box and it would be really helpful to at least get to see what exactly the knowledge retrieval process is adding to the context!! @OpenAI: Please populate the retrieval field

I wrote a detailed article about the experiments and my findings. You can find it over here

I am curious to hear your thoughts on this and experiences you made with the knowledge retrieval process!

dgiannini · December 3, 2023, 12:07am

Very amazing article, thank you! I jumped here looking for something for my topic: MyGPT - Knowledge Behavior

It would be great to have your suggestions for it

jechearte · December 19, 2023, 6:13pm

Thanks @luona.dev for the article, very interesting! I’m also doing some experiments, it’s crazy how many tokens the retrieval tool consumes, I hope they improve it soon.

pjbaron · May 20, 2024, 2:04am

Sounds very interesting and thanks for the brief summary here!
Unfortunately the site link:


It’s likely the web site’s certificate is expired, which prevents Firefox from connecting securely.

What can you do about it?

kon.foo has a security policy called HTTP Strict Transport Security (HSTS), which means that Firefox can only connect to it securely. You can’t add an exception to visit this site.```

Topic		Replies	Views
Looking for clarification on knowledge retrieval and using OpenAI's vector database API assistants , assistants-api	9	4803	December 14, 2023
How does the knowledge of custom GPT actually work Documentation chatgpt	7	17027	December 1, 2023
Optimizing unstructured text data to be used with OpenAI Retrieval? API assistants-files	16	1298	May 15, 2024
Understanding the current Assistant Retrieval process API assistants	7	14142	November 20, 2023
Results Of Using The Assistant API API gpt-4 , api , assistants-api	24	6905	February 3, 2025

Knowledge Retrieval - How to optimize the process

Related topics