What is the method used behind cosine similarity in the evaluation?

Hello,
While building evaluation pipeline on the dashboard, I came across various textual similarity metrics, one of them being cosine similarity. Is there any information on the method used to compute the vectors for the cosine similarity calculation? Is an embedding model being used?
Thank you.

1 Like

Hi,

Welcome to the Developer forum.

I believe this is cosine similarity

(You can probably find a more appropriate video on YouTube)

https://platform.openai.com/docs/guides/embeddings

https://platform.openai.com/docs/pricing

1 Like

In the evals API endpoint and its platform UI, this is what is being discussed.

One can also visit the API Reference and see the actual graders, which have little correlation to the UI.

This post highlights a slight complete absence of any useful documentation at all.


One expect that “cosine similarity”, analogous to a normalized dot product, (wherever you might be encountering that), refers to the typical algorithm for comparing semantic similarity between two embeddings vectors.