Are embeddings tied to a particular model?

fred.fischer · July 3, 2023, 5:11pm

I’m experimenting with embeddings and storing them in a vector DB. My thought is that when I ask a model for the embedding for a given input, that embedding is associated with that model and cannot be reliably used with a different model. Is that true? If so, then if I store embeddings generated with model A, do I need to update or regenerate them if I switch to model A’ or B? BTW, I asked ChatGPT this question and it said I would need to regenerate my embeddings, but thought I would double-check with actual humans! Thanks.

Foxalabs · July 3, 2023, 5:18pm

Certainly with the current state of the art, the embeddings created by a particular model cannot be shared with the embeddings of another, it is an active area of research so this may change, but right now… no.

udm17 · July 4, 2023, 4:57am

Embeddings by nature depend on the encoder with which they are created. With different models and different transformers having different techniques of encoding, there comes a mismatch in the decoder and thus the encoding created becomes garbage

_j · July 4, 2023, 5:16am

Consider that at the most basic level, embeddings with different models have different rank tensors:

Ada (1024 dimensions),
Babbage (2048 dimensions),
Curie (4096 dimensions),
Davinci (12288 dimensions).
ada-V2 (1536 dimensions)

So if the question is if you can mix and match calls and still have any kind of sensical calculation, the answer would be no.

Topic		Replies	Views
New embedding model mapping with old ada002 possible? API embeddings , api , in-the-news	2	628	January 30, 2024
Is retraining for documents required, if change to latest embedding models? GPT builders embeddings	1	871	January 29, 2024
Does openai Question embeddings change everytime? API api-embedding	1	208	October 7, 2024
Non-deterministic embedding models? API	1	1536	February 18, 2024
Are vectors generated by text-embedding-3-small always the same for the same text input? API embeddings	3	823	May 8, 2024

Are embeddings tied to a particular model?

Related topics