Why OpenAI Assistants retrieval is so "file" oriented? How do you work around this?

louis030195 · February 16, 2024, 7:25am

From the docs:

// Upload a file with an "assistants" purpose
const file = await openai.files.create({
  file: fs.createReadStream("knowledge.pdf"),
  purpose: "assistants",
});

// Add the file to the assistant
const assistant = await openai.beta.assistants.create({
  instructions: "You are a customer support chatbot. Use your knowledge base to best respond to customer queries.",
  model: "gpt-4-turbo-preview",
  tools: [{"type": "retrieval"}],
  file_ids: [file.id]
});

It’s not very intuitive to me. For example I’m used to work with Supabase or similar db which is usually a list of objects, not a “file”.

Of course we can work around this, but I was wondering if something like this could have made sense:

const documents = [
  { data: 'This is a document' },
  { data: 'This is another document' },
  { data: 'This is a third document' },
  { data: 'This is a fourth document' },
  { data: 'This is a fifth document' },
  { data: 'This is a sixth document' },
  { data: 'This is a seventh document' },
  { data: 'This is a eighth document' },
  { data: 'This is a ninth document' },
  { data: 'This is a tenth document', metadata: { path: 'https://google.com/abcd' }}
]
const files = await Promise.all(documents.map((document) => openai.files.create({ data: document, purpose: "assistants" }))));

// later create assistant w files map id

How do you guys do retrieval over non-files using Assistants API?

merefield · February 16, 2024, 9:24am

You have the option of implementing your own local search on or at least via your own server.

Implement a function locally (on your server) and tell the assistant about it.

louis030195 · February 16, 2024, 11:33am

I should have added:

WITHOUT function calling

_j · February 16, 2024, 11:58am

Assistants’ built in retrieval is based on attaching uploaded files to an assistant. It both inserts some documentation (always) and also gives an internal search function.

What you actually can “add” to insert your vector database knowledge:

additional_instructions as a run parameter. Which might take up to 32k characters like some other inputs.

Containing a preface: “Here’s automatic knowledge retrieval for AI to use, based on the user’s latest input:”

merefield · February 16, 2024, 12:18pm

… wow, that’s restrictive!

louis030195 · February 21, 2024, 5:11pm

That’s a prompt, not retrieval, it does not scale.

Okay, my guess is that one should read database rows of each tables and just “throw” as txt file in the assistant

Pretty dirty but works

Also you need to sync manually every time your data update

_j · February 21, 2024, 6:31pm

No, that’s how you would inject automated semantic search results based on user input flow, retrieved from an top-n threshold-cutoff embeddings-based vector database.

It scales because you are doing embeddings math on demand, not slow AI inference on AI whims with the ability to iterate.

No need to waste full-context tokens letting AI function-call to search for the same thing.

The additional “prompt” (prompt is actually the ending that denotes it is time in context window for the AI to write as its own entity) lets the AI know why text is there and that it is applicable this chat turn.

louis030195 · February 29, 2024, 10:33am

I think you miss completely the point of Assistants API.

If you are doing semantic search on your own why are you even using Assistants? It’s supposed to abstract away this.
Otherwise, sure, I can just use raw LLM calls and do semantic search, function calling, code interpreter, etc. on my own infra

_j · February 29, 2024, 10:39am

You can just rewrite that to an all-purpose madlib:
If you are doing ____ why are you even using Assistants??

Topic		Replies	Views
New "Assistants" API a potential replacement for low level "RAG" style content generation? API	9	8661	March 4, 2024
Assistant API ignoring custom files API assistants-api	4	105	May 12, 2025
The OpenAI console Assistant does not use or find some of the files uploaded in its file search zone API	5	401	October 10, 2024
Understanding the current Assistant Retrieval process API assistants	7	13872	November 20, 2023
Why does my assistant find the right answer from file on Playground but not via API? API	6	1824	December 8, 2023

Why OpenAI Assistants retrieval is so "file" oriented? How do you work around this?

Related topics