What is the method used behind cosine similarity in the evaluation?

alexeu · May 1, 2025, 4:18pm

Hello,
While building evaluation pipeline on the dashboard, I came across various textual similarity metrics, one of them being cosine similarity. Is there any information on the method used to compute the vectors for the cosine similarity calculation? Is an embedding model being used?
Thank you.

phyde1001 · May 1, 2025, 4:23pm

Hi,

Welcome to the Developer forum.

I believe this is cosine similarity

(You can probably find a more appropriate video on YouTube)

https://platform.openai.com/docs/guides/embeddings

https://platform.openai.com/docs/pricing

_j · May 1, 2025, 9:19pm

In the evals API endpoint and its platform UI, this is what is being discussed.

One can also visit the API Reference and see the actual graders, which have little correlation to the UI.

This post highlights a slight complete absence of any useful documentation at all.

One expect that “cosine similarity”, analogous to a normalized dot product, (wherever you might be encountering that), refers to the typical algorithm for comparing semantic similarity between two embeddings vectors.

Topic		Replies	Views
Soft Cosine Measure vs Cosine Similarity API embeddings	2	1248	July 23, 2023
Cosine similarity for RL fine-tuning API	2	45	January 10, 2026
Embeddings and Cosine Similarity API	20	15136	February 25, 2024
Text Similarity Models - Embedding API Query API	5	9758	July 18, 2022
Methodological information about embeddings API	3	550	January 3, 2024

What is the method used behind cosine similarity in the evaluation?

Related topics