What model does the Vector store functionality use?

sefcik.simon · July 30, 2024, 3:37pm

What model does OpenAI use for the embeddings? When the documents get chunked after they are references and put into the vector store?? Thanks for the info folks…

ytyt · July 30, 2024, 3:44pm

Hello,
By default if you’re using

embeddings=OpenAIEmbedding()

It will use : text-embedding-ada-002

Otherwise you can choose your model by adding this :

embeddings = OpenAIEmbeddings(deployment="text-embedding-3-small")

Hope this helped

sefcik.simon · July 30, 2024, 3:50pm

[

Creating vector stores and adding files

](https://platform.openai.com/docs/assistants/tools/file-search/creating-vector-stores-and-adding-files)

You can create a vector store and add files

I mean when its being chunked and processed from your file, i dont mean the embeddings api.

aaron.lutz · July 30, 2024, 4:05pm

We unfortunately know almost nothing when it comes to File Search… At the moment it is kind of a black box when it comes to the inner workings of it like the embedding model, the search query formulation, the search results, and the search parameters. The File Search itself performs really quite well, even with (in our case) over 3000 different files and manages to provide a relevant answer like 90% of the time. However, we need more information on how it actually works and we need to be able to tweak the parameters. So, I can unfortunately not answer your question.

suvijain · August 6, 2024, 7:24pm

Can this be configured/modified through the Assistants/Files UI on the platform?

aaron.lutz · August 7, 2024, 9:38am

You can configure the chunking of the files when uploading them to the vector stores for the embedding/vectorization, and you can customize how many max results the file search can return in the UI as well as the API. However, that is all we can do for “customizing” the file search at the moment. You can try to steer it more into a general direction using prompting for how to formulate the msearch query (what the assistant uses to query the semantic and keyword search tool) and what it does with the results it received, but it’s not very reliable and ideal, since there is no good way to check the search procedure.

Topic		Replies	Views
Customizing chunk size for file_search tool API api , assistants-api	1	120	March 21, 2025
Which embedding model does Open AI assistant use? API	0	1471	November 9, 2023
Download raw file_search embeddings? API api	1	587	April 24, 2024
Vectors vs Embeddings - are embeddings now obsolete? API api	1	512	May 21, 2024
Clarification on Specifying Embedding Models in OpenAI Vector Stores API rag , vector-store , file-search	0	65	April 25, 2025

What model does the Vector store functionality use?

Creating vector stores and adding files

Related topics