File search disregards max num results

groverkartik25 · August 6, 2024, 9:42pm

Even after specifying the “file_search”: { “max_num_results”: 2 } in an assistant createRun the response sometimes fetching more than 2 annotations from the vector store. Is this a bug or is there another way to do this…
CODE:

const createRunAndPollStatus = async (threadId, tools = [{ 
    "type": "file_search", 
    "file_search": { "max_num_results": 2 }
}]) => {
    const delay = ms => new Promise(res => setTimeout(res, ms));
    const maxAttempts = 10; // Adjust based on your needs
    
    try {
        const createResponse = await axios.post(`https://api.openai.com/v1/threads/${threadId}/runs`, {
            assistant_id: assistantId,
            tools: tools
        }, {
            headers: {
                "Authorization": `Bearer ${process.env.OPENAI_API_KEY}`,
                "OpenAI-Beta": "assistants=v2"
            }
        });

Thanks

HenryHung · August 20, 2024, 2:16pm

I’m using Azure OpenAI (API version 2024-07-01-preview) and have encountered the same problem. Even if I set this parameter to 3, the prompt token for file search in the GPT-4 assistant is still 16k, which equals 800 tokens * 20 chunks.

I think there might be some bugs on OpenAI’s side causing this parameter to malfunction.

assistant = await client.beta.assistants.create(
    model="<PLACEHOLDER>",
    name="<PLACEHOLDER>",
    instructions="<PLACEHOLDER>",
    tools=[
        {
            "type": "file_search",
            "file_search": {
                "max_num_results": 3
            }
        }
    ],
    tool_resources={"file_search": {"vector_store_ids": ["<PLACEHOLDER>"]}},
)

groverkartik25 · August 30, 2024, 10:04pm

yes I think its a bug too. I also found out I had to specify it as a tool choice to make sure it gets called everytime:

    const createResponse = await axios.post(`https://api.openai.com/v1/threads/${threadId}/runs`, {
        assistant_id: assistantId,
        tools: tools,
        tool_choice: {
            type: "file_search"
        }

FreqShift · October 18, 2024, 4:37pm

Definitively a bug, a few experimentations lead me to believe the max_results (coded at the assistant or run objects) is doubled. This workaround may help:

max_results = assistant.tools[0].file_search.max_num_results
max_results = math.ceil(max_results / 2)
user_prompt = f"{user_prompt}\n - retrieve a maximum of {max_results} items"

baglan · April 18, 2025, 1:02pm

The issue persists to this day. In Response API, tool calls to the File Search tool(to a vector store with a PDF book) counts to at least 20. The response output is maxxed at 20 can all filled with tool call.

This happens to only 4o-mini though. 4.1-mini and 4.1-nano make 1 call to the same vector store.

Topic		Replies	Views
File-search is ignores the "Max num results" API chatgpt	6	1046	June 24, 2024
'file_search' Tool Broken (Assistants API) Bugs	7	476	December 7, 2024
Possible Assistants API v2 bug with max_prompt_tokens + tools API	5	1264	June 11, 2024
Maximum Files per Assistant (20) to low API	5	322	February 11, 2025
Assistants v2 file_search not using the files in a consistent way API assistants , assistants-api	9	1742	September 17, 2024

File search disregards max num results

Related topics