The documentation is too lacking

poidomani47 · August 23, 2024, 12:41pm

Someone should seriously take care of the documentation that is too lacking.
Example:
file_ids
array

Optional
A list of file IDs to add to the vector store. There can be a maximum of 10000 files in a vector store.
Question: if I don’t attach an array of file_ids, are all the ones in the store used or are they all ignored?

I don’t see this information written anywhere.
And this is just a simple example, there are more important cases, for example streaming, a very complex topic with documentation that fits on a postage stamp.

qingliaowu · August 23, 2024, 1:13pm

Yes, this is undocumented.

_j · August 23, 2024, 7:34pm

While the reply above spotted the use of “10000”, and a link about 10000 that I wrote was provided in that reply, it is not answering the question.

An assistants’ vector store can be created without any documents, just to obtain its ID.

Then you add the document file IDs (files you’ve already uploaded, obtaining a file ID.)

“Array” being permitted in some calls means you can list many files all at once to be added, instead of making many API calls to add single IDs.

The vector store maintains all the files added, and you can add more, along with deleting by ID. So, once the vector store has the documents you want as assistant behavior and it is connected, you don’t have to continue referring to files.

Searches against the vector database search all the documents, and a vector store attached to an assistant and a separate vector store that can be added to a conversation thread are all combined into one search. Files for assistant behavior and files that you might allow a user to upload are co-mingled into the same single search function, where the AI cannot discriminate the uploader of the file, making it problematic (besides internal instructions that say “the user uploaded these files” despite them being an assistant’s skill).

If you are looking at API reference, obtaining just the parameter that was pasted, perhaps instead you want to click Documentation, which has more tutorial-like explaining. Then you can evaluate how this actually works and see if returning searched chunks of documents based on phrase similarity (and not the source) and adding them as tokens to a chat (at expense) is fit for any task.

poidomani47 · August 24, 2024, 12:49pm

what is not clear is if I add a store and do not specify a list of file_ids are all the files consulted or none?
Thanks

Ing. G.Poidomani

_j · August 27, 2024, 1:44am

No uploaded files are processed with document extraction or placed into a vector database if they aren’t specifically added to a vector store.

If am an API developer, and have one client with proprietary pricelists and troubleshooting database, I certainly wouldn’t want anything automatically added to another client’s assistant file search simply because I didn’t specify file IDs.

Topic		Replies	Views
Add files to existing vector store API vector-db , file-uploads , knowledge-files	13	9675	December 11, 2024
Chat with one file in a multi file vector store or combine vector stores API	4	57	May 17, 2025
Max 100 files in vector store API assistants	9	4515	May 10, 2024
Having trouble developing with the Assistant API? The API documentation is unclear, please help API gpt-4 , chatgpt , api	2	986	February 22, 2024
Can someone explain how to attach files to assistants in V2 using API? API	3	3685	July 4, 2024

The documentation is too lacking

Related topics