Semantic vs search embedding

Mayank11 · September 27, 2023, 7:14am

Openai makes distinction between similarity and search embeddings saying that similarity embeddings are more suited to assess if 2 texts are similar while search embeddings are more suited to identify if a short text is closely related to a much longer text.

Which models from openai embeddings specialize in which function? For example, for which use case should text-embedding-ada-002 model be used for?

Innovatix · September 27, 2023, 7:57am

Semantic embeddings are better for measuring the similarity between two texts, while search embeddings are better for finding long texts that are relevant to a short query.

Example:

Semantic embedding: What is the similarity between the sentences: ‘The cat sat on the mat’ and ‘The feline sat on the rug’?
Search embedding: Find all documents in the database that are relevant to the query: ‘What is the capital of France?’

Mayank11 · September 27, 2023, 10:06am

What does text-embedding-ada-002 do well - Semantic or Search ?

Innovatix · September 28, 2023, 8:39am

It can handle both semantic and search tasks well

Topic		Replies	Views
Models: Embedding vs Similarity vs Search Models API api	4	3252	July 9, 2023
Ada embedding vs SBERT API	1	4345	December 24, 2023
Search vs Similarity API	2	1802	August 19, 2022
Embedding with "text-search-davinci-query-001" API embeddings , chatgpt	3	1248	December 24, 2023
GPT3 vs SBERT for semantic search/similarity? API	1	1934	January 24, 2023

Semantic vs search embedding

Related topics