Text-embedding-3-large at 256 or 3072 dimensions

david.portabella · October 3, 2024, 8:19pm

openai.embeddings.create(input=text, model="text-embedding-3-large").data[0].embedding
this returns a vector of len 3072, if the dimension is not defined.
opeani filesearch uses by default a text-embedding-3-large at 256 dimensions.

why? what is best, 256 or 3072? how to choose?

I asked chatgpt about it, but the answer does not help much.

Larger Vectors (e.g., 3072 dimensions):

Pros: Can capture more intricate details and nuances about the input text. This is generally beneficial if you need higher accuracy for tasks such as semantic search or nuanced recommendations.

Cons: More computationally expensive to process and store. It may also lead to diminishing returns in certain applications where additional detail does not provide significant extra value.

Smaller Vectors (e.g., 256 dimensions):

Pros: More efficient in terms of storage and computation. Often suitable for applications where approximate similarity is sufficient.

Cons: Might miss out on some of the depth and nuances captured in larger embeddings.

When OpenAI or any other provider offers a specific default embedding dimension, like 256, in a product feature (e.g., file search), it is generally because this dimension size balances well among various factors like performance, speed, and storage efficiency for the typical use cases of that service.

What’s best for you depends on your specific use case. If computational resources and speed are a priority, and if you find that 256 dimensions suffice for your accuracy needs, the smaller embeddings are preferable. However, if accuracy is paramount and you’re dealing with complex language tasks, you may opt for a larger embedding size, if available.

Topic		Replies	Views
Embeddings performance difference between small vs large at 1536 dimensions? API embeddings , vector-db	11	12763	April 13, 2024
Understanding "text-embedding-ada-002" vector length of 1536 API	5	21838	January 21, 2024
Are OpenAI text-embedding-ada-002 embedding model greater than text-embedding-3-large? Community embeddings , chatgpt , api	1	1814	February 21, 2024
Better performance using text-embedding-3-large? API embeddings	6	6311	February 11, 2025
Query embedding threshold evaluation with curbing dimension API embeddings	1	197	August 5, 2024

Text-embedding-3-large at 256 or 3072 dimensions

Related topics