I am using File Seach tool and have stored json files in OpenAI Vector DB. I saw that default chunks that could be retrieved are 20 and could be fewer. I saw a note giving reasons why retrieved chunks could be lesser than the number of chunks set by the me. Can someone understand what does token budget means in this scenario. Does it mean even though gpt-3.5turbo-0125 has context size of 16k tokens only could store 4000 token info from vector db + system prompt?
Here is the information :
" Note that the file_search
tool may output fewer than this number for a myriad of reasons:
- The total number of chunks is fewer than
max_num_results
. - The total token size of all the retrieved chunks exceeds the token “budget” assigned to the
file_search
tool. Thefile_search
tool currently has a token budget of:- 4,000 tokens for
gpt-3.5-turbo
- 16,000 tokens for
gpt-4*
models"
- 4,000 tokens for