Token Budget in File Search tool

manojnagabandi · June 7, 2024, 10:45am

I am using File Seach tool and have stored json files in OpenAI Vector DB. I saw that default chunks that could be retrieved are 20 and could be fewer. I saw a note giving reasons why retrieved chunks could be lesser than the number of chunks set by the me. Can someone understand what does token budget means in this scenario. Does it mean even though gpt-3.5turbo-0125 has context size of 16k tokens only could store 4000 token info from vector db + system prompt?

Link for Reference

Here is the information :
" Note that the file_search tool may output fewer than this number for a myriad of reasons:

The total number of chunks is fewer than max_num_results.
The total token size of all the retrieved chunks exceeds the token “budget” assigned to the file_search tool. The file_search tool currently has a token budget of:
- 4,000 tokens for gpt-3.5-turbo
- 16,000 tokens for gpt-4* models"

jr.2509 · June 7, 2024, 11:23am

Your interpretation is correct.

manojnagabandi · June 7, 2024, 12:03pm

If it is really true, I don’t understand the reasoning behind defining the chunk size (4096 max.) and chunk overlap strategy (max.2048) . For example if i have chunk size of 4096 as my default chunk size open ai always retrieves 1 or 0 chunk as it exceeds the limit set by them .

Topic		Replies	Views
Understanding AI Assistant input token counts Prompting gpt-4 , lost-user , assistants-api	5	3214	June 26, 2024
Is 17k Tokens for infromation retrieval from file too much? API gpt-4 , gpt-35-turbo , api	4	517	April 28, 2024
Why is token use so high when using file search? API	2	422	June 11, 2024
Assistant API + gpt4o + filesearch uses more tokens then gpt3.5 API assistants-api	1	160	July 5, 2024
Question about tokens // assistant GPT builders chatgpt	4	41	August 11, 2024

Token Budget in File Search tool

Related Topics