OpenAI eval API - text similarity grading

mcling · December 9, 2025, 9:42am

Hi

I have a question about using the eval API.

For the text_similarity grader, what is the parameter in the testing_criteria to decide the passing grade?

Here is my code and all the results show `passing` even if the similarity is below 0.5

eval_create_result = client.evals.create(
    name="Similarity Check",
    metadata={
        "description": "This eval tests text similarity"
    },
    data_source_config={
        "type": "custom",
        "item_schema": Query.model_json_schema(), # we will upload python objects as run data
        "include_sample_schema": True
    },
    testing_criteria=[
        {
            "type": "text_similarity",
            "name": "Compare text similarity",
            "input": "{{ sample.output_text }}",
            "evaluation_metric": "cosine",
            "reference": "{{ item.text }}",
            "passing_grade": 0.8
        }
    ],
)

_j · December 10, 2025, 7:53am

You might need to review the parameter names accepted by textSimilarityGrader…

{
    "type": "text_similarity",
    "name": string,
    "input": string,
    "reference": string,
    "pass_threshold": number,
    "evaluation_metric": "cosine"
}

pass_threshold is what should return a boolean.

mcling · December 10, 2025, 8:08am

Oh @@, thanks

Good to know the place for the documentation of the grader

Topic		Replies	Views
How to Use New Evals UI in Dashboard API evals	6	575	May 12, 2025
Cosine similarity for RL fine-tuning API	2	79	January 10, 2026
Are evals and graders deprecated/not adapted for Responses API? API evals	0	167	May 27, 2025
Grading system using openai on a scale and based on specific parameters Documentation plugin-development , api	0	417	November 19, 2023
Setting score_threshold parameter for file search tool API assistants-api , file-search	3	1088	November 5, 2024

OpenAI eval API - text similarity grading

Related topics